Issue No.01 - January (1998 vol.9)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/71.655238
<p><b>Abstract</b>—This paper presents the results of the <it>Cedar Hand-Parallelization Experiment</it>, conducted from 1989 through 1992, within the Center for Supercomputing Research and Development (CSRD) at the University of Illinois. In this experiment, we manually transformed the Perfect Benchmarks® into parallel program versions. In doing so, we used techniques that may be automated in an optimizing compiler. We then ran these programs on the Cedar multiprocessor (built at CSRD during the 1980s) and measured the speed improvement due to each technique.</p><p>The results presented here extend the findings previously reported in [<ref rid="bibl000511" type="bib">11</ref>]. The techniques credited most for the performance gains include array privatization, parallelization of reduction operations, and the substitution of generalized induction variables. All these techniques can be considered extensions of transformations that were available in vectorizers and commercial restructuring compilers of the late 1980s. We applied these transformations by hand to the given programs, in a mechanical manner, similar to that of a parallelizing compiler. Because of our success with these transformations, we believed that it would be possible to implement many of these techniques in a new parallelizing compiler. Such a compiler has been completed in the meantime and we show preliminary results.</p>
Program parallelization, parallelization techniques, restructuring compilers, performance evaluation.
Jay Hoeflinger, Rudolf Eigenmann, "On the Automatic Parallelization of the Perfect Benchmarks®", IEEE Transactions on Parallel & Distributed Systems, vol.9, no. 1, pp. 5-23, January 1998, doi:10.1109/71.655238