
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
T.H. Tzen, L.M. Ni, "Dependence Uniformization: A Loop Parallelization Technique," IEEE Transactions on Parallel and Distributed Systems, vol. 4, no. 5, pp. 547558, May, 1993.  
BibTex  x  
@article{ 10.1109/71.224217, author = {T.H. Tzen and L.M. Ni}, title = {Dependence Uniformization: A Loop Parallelization Technique}, journal ={IEEE Transactions on Parallel and Distributed Systems}, volume = {4}, number = {5}, issn = {10459219}, year = {1993}, pages = {547558}, doi = {http://doi.ieeecomputersociety.org/10.1109/71.224217}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Parallel and Distributed Systems TI  Dependence Uniformization: A Loop Parallelization Technique IS  5 SN  10459219 SP547 EP558 EPD  547558 A1  T.H. Tzen, A1  L.M. Ni, PY  1993 KW  Index Termsdata dependence uniformisation; loop parallelization; doubly nested loop; irregulardependence constraints; dependence constraints; synchronization method; parallelalgorithms; parallel programming VL  4 JA  IEEE Transactions on Parallel and Distributed Systems ER   
Data dependence uniformization, a method for overcoming the difficulties in parallelizing a doubly nested loop with irregular dependence constraints is proposed. This approach is based on the concept of vector decomposition. A simple set of basic dependences is developed from which all dependence constraints can be composed. The set of basic dependences is added to every iteration to replace all original dependences so that the dependence constraints become uniform. An efficient synchronization method ispresented to obey the uniform dependence constraints in every iteration.
[1] P. Tang and P.C. Yew, "Processor selfscheduling for multiplenested parallel loops," inProc. 1986 Int. Conf. Parallel Processing, Aug. 1986, pp. 528535.
[2] C. Polychronopoulos and D. Kuck, "Guided selfscheduling: A practical scheduling scheme for parallel supercomputers,"IEEE Tran. Comput., 1987.
[3] Z. Fang, P.C. Yew, P. Tang, and C. Q. Zhu, "Dynamic processor selfscheduling for general parallel nested loops,"IEEE Trans. Comput., vol. 39, pp. 919929, July 1990.
[4] T. H. Tzen and L. M. Ni, "Dynamic loop scheduling for sharedmemory multiprocessors," inProc. 1991 Int. Conf. Parallel Processing, vol. II, Aug. 1991, pp. 247250.
[5] T. H. Tzen and L. M. Ni, "Trapezoid selfscheduling: A practical scheduling scheme for parallel compilers,"IEEE Trans. Parallel Distributed Syst., vol. 4, no. 1, pp. 8798, Jan. 1993.
[6] R. Cytron, "Doacross: Beyond vectorization for multiprocessors," inProc. 1988 Int. Conf. Parallel Processing, Aug. 1986, pp. 836844.
[7] S. P. Midkiff and D. A. Padua, "Compiler algorithms for synchronization,"IEEE Trans. Comput., vol. C36, no. 12, pp. 14851495, Dec. 1987.
[8] M. Wolfe,Optimizing Supercompilers for Supercomputers. Cambridge MA: MIT Press, 1989.
[9] The Parallel Computing Forum,PCF Fortran: Language Definition, revision 1.5 ed., Aug. 1990.
[10] D. J. Kuck, Y. Muraoka, and S. C. Chen, "On the number of operations simultaneously executable in Fortranlike programs and their resulting speedup,"IEEE Trans. Comput., vol. C21, pp. 12931310, Dec. 1972.
[11] L. Lamport, "The parallel execution of DO loops,"Commun. ACM, vol. 17, no. 2, pp. 8393, Feb. 1974.
[12] M. J. Wolfe, "Loop skewing: The wavefront method revisited,"Int. J. Parallel Programming, vol. 15, no. 4, pp. 279293, Aug. 1986.
[13] Z. Li, P.C. Yew, and C.Q. Zhu, "An efficient data dependence analysis for parallelizing compilers,"IEEE Trans. Parallel Distributed Syst., vol. 1, pp. 2634, Jan. 1990.
[14] Z. Shen, Z. Li, and P. C. Yew, "An empirical study on array subscripts and data dependences," inProc. 1989 Int. Conf. Parallel Processing, vol. II, Aug. 1989, pp. 145150.
[15] R. H. Kuhn, "Optimization and interconnection complexity for: parallel processors, single stage networks, and decision trees," Ph.D. dissertation, Dep. Comput. Sci., Rep. 801009, Univ. of Illinois, UrbanaChampaign, IL, 1980.
[16] D. Malm,A Computer Laboratory Manual for Number Theory. Wentworth, NH: COMPress, 1980.
[17] U. Banerjee,Dependence Analysis for Supercomputing, Kluwer Academic Publishers, Norwell, Mass., 1988.
[18] T. H. Tzen, "Advanced loop parallelization: Dependence uniformization and trapezoid selfscheduling," Ph.D. dissertation, Michigan State Univ., Jan. 1992.
[19] G. F. Pfister and V. A. Norton, "Hotspot contention and combining in multistage interconnection networks,"IEEE Trans. Comput., vol. C34, pp. 934948, Oct. 1985.
[20] A. K. Nanda, H. Shing, T. H. Tzen, and L. M. Ni, "Resource contention in sharedmemory multiprocessors: A parameterized performance degradation model,"J. Parallel and Distributed Comput., vol. 12, pp. 313328, 1991.
[21] H. Su and P. Yew, "On data synchronization for multiprocessors," inProc. 16th Int. Symp. Comput. Architecture, May 1989, pp. 416423.
[22] BBN Advanced Computers Inc., Cambridge, MA,Overview of the Butterfly GP1000, Nov. 1988.
[23] BBN Advanced Computers Inc., Cambridge, MA,Mach 1000 Fortran Compiler Reference, revision 1.0 ed., Nov. 1988.