2004 International Conference on Parallel Processing (ICPP'04)
Global Partial Replicate Computation Partitioning
Montreal, Quebec, Canada
August 15-August 18
ISBN: 0-7695-2197-5
Early parallelizing compilers use the owner-computes rule to partition computation. Partial replication is then introduced to eliminate near-neighbor communication at the cost of some replicated computation, hence improves the performance and scalability. Current exploration of partial replicate computation partitioning is limited within a single loop nest. In this paper, we present a formal description of the global partial replicate computation partitioning problem, a simplified cost model and a heuristic solution. Experimental results show that the solution is superior to local approaches.
Index Terms:
parallelizing compiler, partial replicate computing partitioning, data dependence, distributed memory systems, data parallel
Citation:
Yiran Wang, Li Chen, Zhao-Qing Zhang, "Global Partial Replicate Computation Partitioning," icpp, pp.108-115, 2004 International Conference on Parallel Processing (ICPP'04), 2004