The Community for Technology Leaders
Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques (2001)
Barcelona, Spain
Sept. 8, 2001 to Sept. 12, 2001
ISBN: 0-7695-1363-8
pp: 0243
Lawrence Rauchwerger , Texas A&M University
María Jesús Garzarán , Universidad de Zaragoza
Ye Zhangy , University of Illinois at Urbana-Champaign
Alin Jula , Texas A&M University
Hao Yu , Texas A&M University
Josep Torrellas , University of Illinois at Urbana-Champaign
Milos Prvulovic , University of Illinois at Urbana-Champaign
ABSTRACT
Abstract: Reductions are important and time-consuming operations in many scientific codes. Effective parallelization of reductions is a critical transformation for loop parallelization, especially for sparse, dynamic applications. Unfortunately, conventional reduction parallelization algorithms are not scalable. In this paper, we present new architectural support that significantly speeds-up parallel reduction and makes it scalable in shared-memory multiprocessors. The required architectural changes are mostly confined to the directory controllers. Experimental results based on simulations show that the proposed support is very effective. While conventional software-only reduction parallelization delivers average speedups of only 2.7 for 16 processors, our scheme delivers average speedups of 7.6.
INDEX TERMS
CITATION
Lawrence Rauchwerger, María Jesús Garzarán, Ye Zhangy, Alin Jula, Hao Yu, Josep Torrellas, Milos Prvulovic, "Architectural Support for Parallel Reductions in Scalable Shared-Memory Multiprocessors", Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques, vol. 00, no. , pp. 0243, 2001, doi:10.1109/PACT.2001.953304
100 ms
(Ver 3.3 (11022016))