loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Automatic Prefetch and Modulo Scheduling Transformations for the Cell BE Architecture
PrePrint
ISSN: 1045-9219
Nikola Vujic, Barcelona Supercomputing Center
Marc Gonzàlez Tallada, Technical University of Catalonia
Xavier Martorell, Barcelona Supercomputing Center
Eduard Ayguadé, Barcelona Supercomputing Center
Ease of programming is one of the main requirements for the broad acceptance of multi-core systems without hardware support for transparent data transfer between local and global memories. Software cache is a robust approach to provide the user with a transparent view of the memory architecture; but this software approach can suffer from poor performance. In this paper, we propose a hierarchical, hybrid software-cache architecture that targets enabling prefetch techniques. Memory accesses are classified at compile time into two classes: high-locality and irregular. Our approach then steers the memory references toward one of two specific cache structures optimized for their respective access pattern. The specific cache structures are optimized to enable high-level compiler optimizations to aggressively unroll loops, reorder cache references, and/or transform surrounding loops so as to practically eliminate the software cache overhead in the innermost loop. The cache design enables automatic prefetch and modulo scheduling transformations. Performance evaluation indicates that optimized software cache structures combined with the proposed prefetch techniques translate into speed-up between 10% and 20%. As a result of the proposed technique, we can achieve similar performance on the Cell BE processor as on a modern server-class multi-core such as the IBM PowerPC 970MP processor for a set of parallel NAS applications.
Index Terms:
Emerging technologies, Heterogeneous (hybrid) systems, Parallel Architectures, Multi-core/single-chip multiprocessors, Memory hierarchy
Citation:
Nikola Vujic, Marc Gonzàlez Tallada, Xavier Martorell, Eduard Ayguadé, "Automatic Prefetch and Modulo Scheduling Transformations for the Cell BE Architecture," IEEE Transactions on Parallel and Distributed Systems, 12 Jun. 2009. IEEE computer Society Digital Library. IEEE Computer Society, <http://doi.ieeecomputersociety.org/10.1109/TPDS.2009.97>
Usage of this product signifies your acceptance of the Terms of Use.