• Publication
  • 1992
  • Issue No. 4 - July
  • Abstract - Performance Measurement and Trace Driven Simulation of Parallel CAD and Numeric Applications on a Hypercube Multicomputer
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Performance Measurement and Trace Driven Simulation of Parallel CAD and Numeric Applications on a Hypercube Multicomputer
July 1992 (vol. 3 no. 4)
pp. 451-464
The performance evaluation, workload characterization, and trace-driven simulation of a hypercube multicomputer running realistic workloads are presented. Eleven representative parallel applications were selected as benchmarks. Software monitoring techniques were then used to collect execution traces. Based on the measurement results, both the computation and communication behavior of these parallel programs were investigated. The various time interval distributions were modeled by statistical functions which were verified by a nonlinear regression technique using the empirical data. The temporal and spatial localities of message destinations were also studied. A model for the temporal locality of message length was introduced and used to analyze the communication traces. A trace-drive simulation environment, which uses the communication patterns of the parallel programs as inputs, was developed to study the behavior of the communication hardware under real workload. Simulation results on DMA and link utilizations are reported.

[1] C. L. Seitz, "The Cosmic Cube,"Commun. ACM, pp. 22-33, Jan. 1985.
[2] J. Tuazon, J. Peterson, and M. Pniel, "Mark IIIfp hypercube concurrent processor architecture," inProc. 3rd Conf. Hypercube Concurrent Comput. Appl., Pasadena, CA Jan. 1988, pp. 71-80.
[3] P. Close, "The iPSC/2 Node Architecture,"Proc. Third Conf. Hypercube Concurrent Computers and Applications, Jan. 1988, SIAM.
[4] J.-M. Hsu and P. Banerjee, "Performance measurement and trace driven simulation of parallel CAD and numeric applications on a hypercube multicomputer," inProc. 17th Int. Symp. Comput. Architecture, Seattle, WA, May 1990, pp. 260-269.
[5] P. Kermani and L. Kleinrock, "Virtual cut-through: A new computer communication switching technique,"Comput. Networks, vol. 3, pp. 267-286, North-Holland, Sept. 1979.
[6] S. H. Bokhari, "Partitioning problems in parallel, pipelined, and distributed computing,"IEEE Trans. Comput., vol. 37, pp. 48-57, Jan. 1988.
[7] R. Mirchandaney, D. Towsley, and J. A. Stankovic, "Analysis of the effects of delays on load sharing,"IEEE Trans. Comput., vol. 38, pp. 1513-1525, Nov. 1989.
[8] D. C. Grunwald and D. A. Reed, "Networks for parallel processors: Measurements and prognostications," inProc. Third Conf. Hypercube Concurrent Comput. Appl., vol. I, 1988, pp. 610-619.
[9] G. Buzzard and T. Mudge, "High performance hypercube communication," inProc. 3rd Conf. Hypercube Concurrent Comput. Appl., Pasadena, CA, Jan. 1988, pp. 600-609.
[10] M. C. Hsueh, R. K. Iyer, and K. S. Trivedi, "Performability modeling based on real data: A case study,"IEEE Trans. Comput., vol. 37, pp. 478-484, Apr. 1988.
[11] M. V.-S. Devarakonda, "File usage analysis and resource usage prediction: A measurement-based study," Tech. Rep. CSG-79, Coordinated Science Lab., Univ. of Illinois, Urbana, IL, Dec. 1987.
[12] D. A. Reed and R. M. Fujimoto,Multicomputer Networks, Message-Based Parallel Processing. Cambridge, MA: MIT Press, 1987.
[13] R. L. Mattson, J. Gecsei, D. R. Slutz, and I. L. Traiger, "Evaluation techniques for storage hierarchies,"IBM Syst. J., vol 9, 1970.
[14] J. Sargent and P. Banerjee, "A parallel row-based algorithm for standard cell placement with integrated error control," inProc. 26th Design Automation Conf., Las Vegas, NV, June 1989, pp. 590-594.
[15] R. J. Brouwer and P. Banerjee, "A parallel simulated annealing algorithm for channel routing on a hypercube multiprocessor," inProc. IEEE Int. Conf. Comput. Design, Rye Brook, NY, Oct. 1988, pp. 4-7.
[16] K. P. Belkhale and P. Banerjee, "PACE2: An improved parallel VLSI extractor with parametric extraction," inProc. Int. Conf. Comput.-Aided Design, Santa Clara, CA, Nov. 1989, pp. 526-530.
[17] S. Patil and P. Banerjee, "A Parallel Branch-and-Bound Algorithm for Test Generation,"Proc. 26th ACM/IEEE Design Automation Conf., CS Press, Los Alamitos, Calif., Order No. 1961, June 1989, pp. 339-334.
[18] S. Patil and P. Banerjee, "Fault Partitioning Issues in an Integrated Parallel Test Generation/Fault Simulation Environment,"Proc, 1989 Int'l Test Conf., CS Press, Los Alamitos, Calif., Order No. 1962, 1989, pp. 718-726.
[19] G. H. Golub and C. F. Van Loan,Matrix Computations. Baltimore, MD: John Hopkins Univ. Press, 1983.
[20] A. Pothen, S. Jha, and U. Vemulapati, "Orthogonal factorization on a distributed memory multiprocessor," inProc. 2nd SIAM Conf. Hypercube Comput. Appl., 1987, pp. 587-596.
[21] M. Foxet al., Solving Problems on Concurrent Processors, vol. 1. Englewood Cliffs, NJ: Prentice-Hall, 1988.
[22] G. A. Geist and M. T. Heath, "Matrix factorization on a hypercube multiprocessor," inProc. SIAM 1st Conf. Hypercube Multiprocessors, Knoxville, TN, Aug. 1985.
[23] C. B. Stunkel, "Linear optimization via message-based parallel processing," inProc. 1988 Int. Conf. Parallel Processing, vol. 3, Aug. 1988, pp. 264-271.
[24] R. A. Fatoohi and C. E. Grosch, "Implementation and analysis of a Navier-Stokes algorithm on parallel computers," inProc. 1988 Int. Conf. Parallel Processing, vol. 3, Aug. 1988, pp. 235-242.
[25] J. J. Dongarra and C. B. Moler, EISPACK--A package for solving matrix Eigenvalue problems," inSources and Production of Mathematical Software, W. Cowell, Ed. Englewood Cliffs, NJ: Prentice Hall, 1984, pp. 68-87.
[26] P. Pierce, "The NX/2 Operating System,"Proc. Third Conf. Hypercube Concurrent Computers and Applications, ACM Press, 1988, pp. 384-390.
[27] D. C. Rudolph, "Performance instrumentation for the Intel IPSC/2," Tech. Rep. UIUCDCS-R-89-1524, Dep. Comput. Sci., Univ. of Illinois, Urbana, IL, July 1989.
[28] J.-M. Hsu and P. Banerjee, "Hardware support for message routing in a distributed memory multicomputer," inProc. 1990 Int. Conf. Parallel Processingvol. 1, St. Charles, IL, Aug. 1990, pp. 508-515.
[29] J.M. Hsu and P. Banerjee, "A message passing coprocessor for distributed memory multicomputers," inProc. Supercomput. '90, New York, NY, Nov. 1990.
[30] SAS Institute Inc.,SAS User's Guide: Statistics, Version 5, Cary, NC, 1985.
[31] H. Schwetman,CSIM Reference Manual (Revision 13), Microelectronics and Computer Tech. Corp., Austin, TX, May 1988.
[32] S. F. Nugent, "The iPSC/2 Direct-Connect communications technology," inProc. Third Conf. Hypercube Comput. Appl., Pasadena, CA, Jan. 1988, pp. 56-60.

Index Terms:
Index Termssoftware monitoring; spatial locality; parallel CAD; hypercube multicomputer; performance evaluation; workload characterization; realistic workloads; benchmarks; execution traces; parallel programs; time interval distributions; statistical functions; nonlinear regression technique; message destinations; temporal locality; message length; trace-drive simulation environment; DMA; link utilizations; digital simulation; hypercube networks; parallel programming; performance evaluation; statistical analysis
Citation:
J.M. Hsu, P. Banerjee, "Performance Measurement and Trace Driven Simulation of Parallel CAD and Numeric Applications on a Hypercube Multicomputer," IEEE Transactions on Parallel and Distributed Systems, vol. 3, no. 4, pp. 451-464, July 1992, doi:10.1109/71.149963
Usage of this product signifies your acceptance of the Terms of Use.