2014 26th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) (2014)
Jussieu, Paris, France
Oct. 22, 2014 to Oct. 24, 2014
In this paper we describe how to efficiently exploit task parallelism for the solution of sparse linear systems on multithreaded processors via ILUPACK's multi-level preconditioned CG method. Using a pair of data structures, we capture the task dependencies that appear in the two most challenging operations in the method (calculation of the preconditioned and its application), passing this information to the OmpSs runtime which can then implement a correct and efficient schedule of the entire solver. Our results with high-end multicore platforms equipped with Intel and AMD processors report significant performance gains, demonstrating that OmpSs provides an efficient and close-to seamless means to leverage the concurrency in a complex scientific code like ILUPACK.
Vectors, Runtime, Concurrent computing, Parallel processing, Program processors, US Department of Transportation, Programming
J. Aliaga, R. Badia, M. Barreda, M. Bollhofer and E. Quintana-Orti, "Leveraging Task-Parallelism with OmpSs in ILUPACK's Preconditioned CG Method," 2014 26th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Jussieu, Paris, France, 2014, pp. 262-269.