loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth IEEE International Conference on Cluster Computing (CLUSTER'02)
Mixed Mode Matrix Multiplication
Chicago, Illinois
September 23-September 26
ISBN: 0-7695-1745-5
Meng-Shiou Wu, Ames Laboratory — Iowa State University
Srinivas Aluru, Ames Laboratory — Iowa State University
Ricky A. Kendall, Ames Laboratory — Iowa State University
In modern clustering environments where the memory hierarchy has many layers (distributed memory, shared memory layer, cache, …), an important question is how to fully utilize all available resources and identify the most dominant layer in certain computation. When combining algorithms on all layers together, what would be the best method to get the best performance out of all the resources we have? Mixed mode programming model that uses thread programming on the shared memory layer and message passing programming on the distributed memory layer is a method that many researchers are using to utilize the memory resources. In this paper, we take an algorithmic approach that uses matrix multiplication as a tool to show how cache algorithms affect the performance of both shared memory and distributed memory algorithms. We show that with good underlying cache algorithm, overall performance is stable. When underlying cache algorithm is bad, superlinear speedup may occur, and increasing number of threads may also improve performance.
Citation:
Meng-Shiou Wu, Srinivas Aluru, Ricky A. Kendall, "Mixed Mode Matrix Multiplication," cluster, pp.195, Fourth IEEE International Conference on Cluster Computing (CLUSTER'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.