10th Euromicro Workshop on Parallel, Distributed and Network-based Processing (EUROMICRO-PDP 2002)
Towards the Design of an Automatically Tuned Linear Algebra Library
Canary Islands, Spain
January 09-January 11
ISBN: 0-7695-1444-8
In this work we propose the architecture of an automatically tuned linear algebra library, which is composed by a set of linear algebra routines along with their installation routines. During the installation process on a system, the linear algebra routines will be tuned automatically to the system conditions: hardware characteristics and basic libraries used in the linear algebra routines. The design methodology is analysed with a block LU factorisation. Variants for a sequential and parallel version of this routine on a logical rectangular mesh of processors are considered. An analytical model of the algorithm is developed as the basis of our methodology, and the behaviour of the algorithm is analysed with message-passing using MPI on several platforms: Network of SUN workstations, SGI Origin 2000 and IBM SP2, and with different basic linear algebra libraries: reference BLAS, machine-specific BLAS and ATLAS. The experiments show that it is possible to make a good automatic choice of configurable parameters of the linear algebra routines during the installation process. The average execution time of the Linear Algebra Routine is reduced by about 15% with respect to the non-tuned version.
Index Terms:
automatic tuning, linear algebra software, parallel computing
Citation:
Javier Cuenca, Domingo Gimenez, José Gonzalez, "Towards the Design of an Automatically Tuned Linear Algebra Library," pdp, pp.0201, 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing (EUROMICRO-PDP 2002), 2002