loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 4
Parallelization of the NAS Conjugate Gradient Benchmark Using the Global Arrays Shared Memory Programming Model
Denver, Colorado
April 04-April 08
ISBN: 0-7695-2312-9
Yeliang Zhang, University of Arizona
Vinod Tipparaju, Pacific Northwest National Laboratory
Jarek Nieplocha, Pacific Northwest National Laboratory
Salim Hariri, University of Arizona
The NAS Conjugate Gradient (CG) benchmark is an important scientific kernel used to evaluate machine performance and compare characteristics of different programming models. Global Arrays (GA) toolkit supports a shared memory programming paradigm and offers the programmer control over the distribution and locality that are important for optimizing performance on scalable architectures. In this paper, we describe and compare two different parallelization strategies of the CG benchmark using GA and report performance results on a shared-memory system as well as on a cluster. Performance benefits of using shared memory for irregular/sparse computations have been demonstrated before in the context of the CG benchmark using OpenMP. Similarly, the GA implementation outperforms the standard MPI implementation on shared memory system, in our case the SGI Altix. However, with GA these benefits are extended to distributed memory systems and demonstrated on a Linux cluster with Myrinet.
Citation:
Yeliang Zhang, Vinod Tipparaju, Jarek Nieplocha, Salim Hariri, "Parallelization of the NAS Conjugate Gradient Benchmark Using the Global Arrays Shared Memory Programming Model," ipdps, vol. 5, pp.177a, 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 4, 2005
Usage of this product signifies your acceptance of the Terms of Use.