Sixth IEEE International Conference on Cluster Computing (CLUSTER'04)
Implementing parallel conjugate gradient on the EARTH multithreaded architecture
San Diego, CA, USA
September 20-September 23
ISBN: 0-7803-8694-9
Fei Chen, Dept. of Electr. & Comput. Eng., Delaware Univ., Newark, DE, USA
K.B. Theobald, Dept. of Electr. & Comput. Eng., Delaware Univ., Newark, DE, USA
G.R. Gao, Dept. of Electr. & Comput. Eng., Delaware Univ., Newark, DE, USA
Conjugate gradient (CG) is one of the most popular iterative approaches to solving large sparse linear systems of equations. This work reports a parallel implementation of CG on clusters with EARTH multithreaded runtime support. Interphase and intraphase communication costs are balanced using a two-dimensional blocking method, minimizing overall communication costs. EARTH'S adaptive, event-driven multithreaded execution model gives additional opportunities to overlap communication and computation to achieve even better scalability. Experiments on a large Beowulf cluster with gigabit Ethernet show notable improvements over other parallel CG implementations. For example, with the NAS CG benchmark problem size Class C, our implementation achieved a speedup of 41 on a 64-node cluster, compared to 13 for the MPl-based NAS version. The results demonstrate that the combination of the two-dimensional blocking method and the EARTH architectural runtime support helps to compensate for the low communications bandwidth common to most clusters.
Citation:
Fei Chen, K.B. Theobald, G.R. Gao, "Implementing parallel conjugate gradient on the EARTH multithreaded architecture," cluster, pp.459-469, Sixth IEEE International Conference on Cluster Computing (CLUSTER'04), 2004