
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
ChingHsien Hsu, ShengWen Bai, YehChing Chung, ChuSing Yang, "A Generalized BasicCycle Calculation Method for Efficient Array Redistribution," IEEE Transactions on Parallel and Distributed Systems, vol. 11, no. 12, pp. 12011216, December, 2000.  
BibTex  x  
@article{ 10.1109/71.895789, author = {ChingHsien Hsu and ShengWen Bai and YehChing Chung and ChuSing Yang}, title = {A Generalized BasicCycle Calculation Method for Efficient Array Redistribution}, journal ={IEEE Transactions on Parallel and Distributed Systems}, volume = {11}, number = {12}, issn = {10459219}, year = {2000}, pages = {12011216}, doi = {http://doi.ieeecomputersociety.org/10.1109/71.895789}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Parallel and Distributed Systems TI  A Generalized BasicCycle Calculation Method for Efficient Array Redistribution IS  12 SN  10459219 SP1201 EP1216 EPD  12011216 A1  ChingHsien Hsu, A1  ShengWen Bai, A1  YehChing Chung, A1  ChuSing Yang, PY  2000 KW  Redistribution KW  generalized basiccycle calculation method KW  distributed memory multicomputers. VL  11 JA  IEEE Transactions on Parallel and Distributed Systems ER   
Abstract—In many scientific applications, dynamic array redistribution is usually required to enhance the performance of an algorithm. In this paper, we present a
[1] S. Chatterjee, J. Gilbert, F. Long, R. Schreiber, and S. Tseng, “Generating Local Adresses and Communication Sets for Data Parallel Programs,” J. Parallel and Distributed Computing, vol. 26,pp. 72–84, 1995.
[2] G. Fox, S. Hiranandani, K. Kennedy, C. Koelbel, U. Kremer, C.W. Tseng, and M. Wu, “FortranD Language Specification,” Technical Report TR91170, Dept. of Computer Science, Rice Univ., Dec. 1991.
[3] S.K.S. Gupta, S.D. Kaushik, C.H. Huang, and P. Sadayappan, “On the Generation of Efficient Data Communication for DistributedMemory Machines,” Proc. Int'l Computing Symp., pp. 504513, 1992.
[4] Y.C. Chung, C.H. Hsu, and S.W. Bai, “A BasicCycle Calculation Technique for Efficient Dynamic Data Redistribution,” IEEE Trans. Parallel and Distributed Systems, vol. 9, no. 4, Apr. 1998.
[5] J.J. Dongarra, R. Van De Geijn, and D.W. Walker, “A Look at Scalable Dense Linear Algebra Libraries,” Technical Report ORNL/TM12126 from Oak Ridge Nat'l Laboratory, Apr. 1992.
[6] S.K.S. Gupta, S.D. Kaushik, C.H. Huang, and P. Sadayappan, “On Compiling Array Expressions for Efficient Execution on DistributedMemory Machines,” J. Parallel and Distributed Computing, vol. 32, pp. 155172, 1996.
[7] High Performance Fortran Forum, “High Performance Fortran Language Specification (version 1.1),” Rice Univ., Nov. 1994.
[8] S. Hiranandani, K. Kennedy, J. MellorCrammey, and A. Sethi, “Compilation Technique for BlockCyclic Distribution,” Proc. ACM Int'l Conf. Supercomputing, pp. 392403, July 1994.
[9] E. Kalns and L. Ni, “Processor Mapping Techniques towards Efficient Data Redistribution,” IEEE Trans. Parallel and Distributed Systems, vol. 12, no. 6,pp. 1,234–1,247, 1995.
[10] E.T. Kalns and L.M. Ni,“DaReL: A portable data redistribution library for distributedmemory machines,” Proc. 1994 Scalable Parallel Libraries Conf. 2, Oct. 1994.
[11] S.D. Kaushik, C.H. Huang, R.W. Johnson, and P. Sadayappan, “An Approach to CommunicationEfficient Data Redistribution,” Proc. 1994 ACM Int'l Conf. Supercomputing, pp. 364373, June 1994.
[12] S.D. Kaushik, C.H. Huang, J. Ramanujam, and P. Sadayappan, “MultiPhase Array Redistribution: Modeling and Evaluation,” Proc. Int'l Parallel Processing Symp., 1995.
[13] S.D. Kaushik, C.H. Huang, and P. Sadayappan, “Efficient Index Set Generation for Compiling HPF Array Statements on DistributedMemory Machines,” J. Parallel and Distributed Computing, vol. 38, pp. 237247, 1996.
[14] K. Kennedy, N. Nedeljkovic, and A. Sethi, “Efficient Address Generation for BlockCyclic Distribution,” Proc. Int'l Conf. Supercomputing, pp. 180184, July 1995.
[15] C. Koelbel, “CompilerTime Generation of Communication for Scientific Programs,” Supercomputing '91, pp. 101110, Nov. 1991.
[16] P.Z. Lee and W.Y. Chen, “Compiler Techniques for Determining Data Distribution and Generating Communication Sets on DistributedMemory Multicomputers,” Proc. 29th IEEE Hawaii Int'l Conf. System Sciences, pp. 537546, Jan. 1996.
[17] Y.W. Lim, P.B. Bhat, and V.K. Prasanna, “Efficient Algorithms for BlockCyclic Redistribution of Arrays,” Proc. Eighth IEEE Symp. Parallel and Distributed Processing, pp. 7483, 1996.
[18] Y.W. Lim, N. Park, and V.K. Prasanna, “Efficient Algorithms for MultiDimensional BlockCyclic Redistribution of Arrays,” Proc. 26th Int'l Conf. Parallel Processing, pp. 234241, 1997.
[19] L. Prylli and B. Tourancheau, “Fast Runtime Block Cyclic Data Redistribution on Multiprocessors,” J. Parallel and Distributed Computing, vol. 45, 1997.
[20] S. Ramaswamy and P. Banerjee, "Automatic Generation of Efficient Array Redistribution Routines for Distributed Memory Multicomputers," Proc. Frontiers '95: The Fifth Symposium on the Frontiers of Massively Parallel Computation, pp. 342349,McLean, Va., Feb. 1995.
[21] S. Ramaswamy, B. Simons, and P. Banerjee, “Optimizations for Efficient Array Redistribution on Distributed Memory Multicomputers,” J. Parallel and Distributed Computing, vol. 38, no. 2, pp. 217228, Nov. 1996.
[22] J. Stichnoth,D. O’Hallaron,, and T. Gross,“Generating communication for array statements: Design, implementation, and evaluation,” J. of Parallel and Distributed Computing, vol. 21, no. 1, pp. 150159, 1994.
[23] R. Thakur,A. Choudhary,, and G. Fox,“Runtime array redistribution in HPF programs,” Proc. 1994 Scalable High Performance Computing Conf., pp. 309316, May 1994.
[24] R. Thakur, A. Choudhary, and J. Ramanujam, “Efficient Algorithms for Array Redistribution“ IEEE Trans. Parallel and Distributed Systems, vol. 7, no. 6 pp. 587594, June 1996.
[25] A. Thirumalai and J. Ramanujam, “HPF Array Statements: Communication Generation and Optimization,” Proc. Third Workshop Languages, Compilers and Runtime system for Scalable Computers, May 1995.
[26] A. Thirumalai and J. Ramanujam, “Efficient Computation of Address Sequences in DataParallel Programs Using Closed Forms for Basis Vectors,” J. Parallel and Distributed Computing, vol, 38, no. 2, pp. 188203, Nov. 1996.
[27] V. Van Dongen, C. Bonello, and C. Freehill, “High Performance C–Language Specification Version 0.8.9,” Technical Report CRIMEPPP94/0412, 1994.
[28] C. Van Loan, “Computational Frameworks for the Fast Fourier Transform,” SIAM, 1992.
[29] D.W. Walker and S.W. Otto, “Redistribution ofBLOCKCYCLICData Distributions Using MPI,” Concurrency: Practice and Experience, vol. 8, no. 9, pp. 707728, Nov. 1996.
[30] A. Wakatani and M. Wolfe, “A New Approach to Array Redistribution: Strip Mining Redistribution,” Proc. Parallel Architectures and Languages Europe, July 1994.
[31] A. Wakatani and M. Wolfe, “Optimization of Array Redistribution for Distributed Memory Multicomputers,” Parallel Computing, vol. 21, no. 9, pp. 14851490, Sept. 1995.