The Community for Technology Leaders
Green Image
<p><b>Abstract</b>—This paper presents a data layout optimization technique for sequential and parallel programs based on the theory of hyperplanes from linear algebra. Given a program, our framework automatically determines suitable memory layouts that can be expressed by hyperplanes for each array that is referenced. We discuss the cases where data transformations are preferable to loop transformations and show that under certain conditions a loop nest can be optimized for perfect spatial locality by using data transformations. We argue that data transformations can also optimize spatial locality for some arrays without distorting temporal/spatial locality exhibited by others. We divide the problem of optimizing data layout into two independent subproblems: 1) determining optimal static data layouts, and 2) determining data transformation matrices to implement the optimal layouts. By postponing the determination of the transformation matrix to the last stage, our method can be adapted to compilers with different default layouts. We then present an algorithm that considers optimizing parallelism and spatial locality simultaneously. Our results on eight programs on two distributed shared-memory multiprocessors, the Convex Exemplar SPP-2000 and the SGI Origin 2000, show that the layout optimizations are effective in optimizing spatial locality and parallelism.</p>
Data reuse, locality optimizations, spatial locality, memory performance, parallelism, array restructuring.
Mahmut Kandemir, Alok Choudhary, Prithviraj Banerjee, J. Ramanujam, Nagaraj Shenoy, "A Linear Algebra Framework for Automatic Determination of Optimal Data Layouts", IEEE Transactions on Parallel & Distributed Systems, vol. 10, no. , pp. 115-135, February 1999, doi:10.1109/71.752779
99 ms
(Ver )