loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2009 18th International Conference on Parallel Architectures and Compilation Techniques
Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors
Raleigh, North Carolina, USA
September 12-September 16
ISBN: 978-0-7695-3771-9
With increasing numbers of cores, future CMPs (Chip Multi-Processors) are likely to have a tiled architecture with a portion of shared L2 cache on each tile and a bank-interleaved distribution of the address space. Although such an organization is effective for avoiding access hot-spots, it can cause a significant number of non-local L2 accesses for many commonly occurring regular data access patterns. In this paper we develop a compile-time framework for data locality optimization via data layout transformation. Using a polyhedral model, the program's localizability is determined by analysis of its index set and array reference functions, followed by non-canonical data layout transformation to reduce non-local accesses for localizable computations. Simulation-based results on a 16-core 2D tiled CMP demonstrate the effectiveness of the approach. The developed program transformation technique is also useful in several other data layout transformation contexts.
Index Terms:
Data Layout Optimization, Polyhedral Model, NUCA Cache
Citation:
Qingda Lu, Christophe Alias, Uday Bondhugula, Thomas Henretty, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan, Yongjian Chen, Haibo Lin, Tin-fook Ngai, "Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors," pact, pp.348-357, 2009 18th International Conference on Parallel Architectures and Compilation Techniques, 2009
Usage of this product signifies your acceptance of the Terms of Use.