The Community for Technology Leaders
Green Image
<p><b>Abstract</b>—<it>Prefix computation</it> is a basic operation at the core of many important applications, e.g., some of the Grand Challenge problems, circuit design, digital signal processing, graph optimizations, and computational geometry.<ref fid="ft12571" type="fn">1</ref> In this paper, we present new and strict time-optimal parallel schedules for prefix computation with resource constraints under the concurrent-read-exclusive-write (CREW) parallel random access machine (PRAM) model. For prefix of <it>N</it> elements on <it>p</it> processors (<it>p</it> independent of <it>N</it>) when <it>N</it> > <it>p</it>(<it>p</it> + 1)/2, we derive <it>Harmonic Schedules</it> that achieve the <it>strict optimal time</it> (steps), <tmath>$\left\lceil {{{2\left( {N-1} \right)} \mathord{\left/ {\vphantom {{2\left( {N-1} \right)} {\left( {p+1} \right)}}} \right. \kern-\nulldelimiterspace} {\left( {p+1} \right)}}} \right\rceil $</tmath>. We also derive <it>Pipelined Schedules</it> that have better program-space efficiency than the Harmonic Schedule, yet only require a small constant number of steps more than the optimal time achieved by the Harmonic Schedule. Both the Harmonic Schedules and the Pipelined Schedules are simple and easy to implement. For prefix of <it>N</it> elements on <it>p</it> processors (<it>p</it> independent of <it>N</it>) where <it>N</it>≤<it>p</it>(<it>p</it> + 1)/2, the Harmonic Schedules are not time-optimal. For these cases, we establish an optimization method for determining key parameters of time-optimal schedules, based on connections between the structure of parallel prefix and Pascal's triangle. Using the derived parameters, we devise an algorithm to construct such schedules. For a restricted class of values of <it>N</it> and <it>p</it>, we prove that the constructed schedules are strictly time-optimal. We also give strong empirical evidence that our algorithm constructs strict time-optimal schedules for all cases where <it>N</it>≤<it>p</it>(<it>p</it> + 1)/2.</p>
Parallel prefix computation, scan operator resource-constrained parallel algorithms, strict time-optimal schedules, loop parallelization, loop-carried dependences, associative operations, tree-height reduction, Pascal's Triangle, combinatorial optimization.
Haigeng Wang, Kai-Yeng S. Siu, Alexandru Nicolau, "The Strict Time Lower Bound and Optimal Schedules for Parallel Prefix with Resource Constraints", IEEE Transactions on Computers, vol. 45, no. , pp. 1257-1271, November 1996, doi:10.1109/12.544482
172 ms
(Ver 3.3 (11022016))