|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Guojing Cong, I-Hsin Chung, Hui-Fang Wen, David Klepacki, Hiroki Murata, Yasushi Negishi, Takao Moriyama, "A Systematic Approach toward Automated Performance Analysis and Tuning," IEEE Transactions on Parallel and Distributed Systems, vol. 23, no. 3, pp. 426-435, March, 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TPDS.2011.189, author = {Guojing Cong and I-Hsin Chung and Hui-Fang Wen and David Klepacki and Hiroki Murata and Yasushi Negishi and Takao Moriyama}, title = {A Systematic Approach toward Automated Performance Analysis and Tuning}, journal ={IEEE Transactions on Parallel and Distributed Systems}, volume = {23}, number = {3}, issn = {1045-9219}, year = {2012}, pages = {426-435}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPDS.2011.189}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Parallel and Distributed Systems TI - A Systematic Approach toward Automated Performance Analysis and Tuning IS - 3 SN - 1045-9219 SP426 EP435 EPD - 426-435 A1 - Guojing Cong, A1 - I-Hsin Chung, A1 - Hui-Fang Wen, A1 - David Klepacki, A1 - Hiroki Murata, A1 - Yasushi Negishi, A1 - Takao Moriyama, PY - 2012 KW - Performance tuning KW - performance tool. VL - 23 JA - IEEE Transactions on Parallel and Distributed Systems ER - | |||
[1] L. Adhianto, S. Banerjee, M. Fagan, M. Krentel, G. Marin, J. Mellor-Crummey, and N.R. Tallent, "Hpctoolkit: Tools for Performance Analysis of Optimized Parallel Programs," Concurrency and Computation: Practice and Experience, vol. 22, pp. 685-701, http:/hpctoolkit.org., Apr. 2010.
[2] C. Bastoul, "Code Generation in the Polyhedral Model Is Easier than You Think," Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT '04), pp. 7-16, Sept. 2004.
[3] A. Bhatele and G. Cong, "A Selective Profiling Tool: Towards Automatic Performance Tuning," Proc. Third Workshop System Management Techniques, Processes and Services (SMTPS '07), Mar. 2007.
[4] M. Burtscher, B.-D. Kim, J. Diamond, J. McCalpin, L. Koesterke, and J. Browne, "Perfexpert: An Easy-to-Use Performance Diagnosis Tool for Hpc Applications," Proc. ACM/IEEE Int'l Conf. High Performance Computing, Networking, Storage and Analysis (SC '10), pp. 1-11, 2010.
[5] A. Chandramowlishwarany, K. Madduri, and R. Vuduc, "Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method," Proc. ACM/IEEE Int'l Conf. High Performance Computing, Networking, Storage and Analysis (SC '10), pp. 1-12, 2010.
[6] C. Chen, J. Chame, and M.W. Hall, "CHiLL: A Framework for Composing High-Level Loop Transformations," technical report, Univ. of Southern California, 2008.
[7] W. Chen et al., "Using Profile Information to Assist Advanced Compiler Optimization and Scheduling," Advances in Languages and Compilers for Parallel Processing, vol. 757, pp. 31-48, Jan. 1993.
[8] G. Cong, I-H. Chung, H. Wen, D. Klepacki, H. Murata, Y. Negishi, and T. Moriyama, "A Holistic Approach towards Automated Performance Analysis and Tuning," Proc. 15th Int'l Euro-Par Conf. Parallel Processing, pp. 33-44, 2009.
[9] C. Ţăpuş, I-H. Chung, and J.K. Hollingsworth, "Active Harmony: towards Automated Performance Tuning," Proc. ACM/IEEE Conf. Supercomputing (Supercomputing '02), pp. 1-11, 2002.
[10] L. DeRose, K. Ekanadham, J.K. Hollingsworth, and S. Sbaraglia, "SIGMA: A Simulator Infrastructure to Guide Memory Analysis," Proc. ACM/IEEE Conf. Supercomputing (Supercomputing '02), pp. 1-13, 2002.
[11] J.H. Ferziger and M. Peric, Computational Methods for Fluid Dynamics, third ed. Springer-Verlag, 2002.
[12] M. Geimer, F. Wolf, B.J.N. Wylie, E. Abraham, D. Becker, and B. Mohr, "The SCALASCA Performance Toolset Architecture," Proc. Int'l Workshop Scalable Tools for High-End Computing (STHEC), 2008.
[13] M. Gerndt and M. Ott, "Automatic Performance Analysis with Periscope," Concurrency and Computation: Practice and Experience, vol. 22, pp. 736-748, Apr. 2010.
[14] A. Hartono, B. Norris, and P. Sadayappan, "Annotation-Based Empirical Performance Tuning Using Orio," Proc. IEEE Int'l Symp. Parallel and Distributed Processing (IPDPS), pp. 1-11, 2009.
[15] IBM High Productivity Computing Systems Toolkit, http://www.alphaworks.ibm.com/techhpcst, 2011.
[16] A. MacNab, G. Vahala, P. Pavlo, L. Vahala, and M. Soe, "Lattice Boltzmann Model for Dissipative Incompressible MHD," Proc. 28th EPS Conf. Controlled Fusion and Plasma Physics, vol. 25A, pp. 853-856, 2001.
[17] A.D. Malony, S. Shende, R. Bell, K. Li, L. Li, and N. Trebon, "Advances in the Tau Performance System," Performance Analysis and Grid Computing, pp. 129-144, Kluwer Academic Publishers, 2004.
[18] B.P. Miller, M.D. Callaghan, J.M. Cargille, J.K. Hollingsworth, R.B Irvin, K.L. Karavanic, K. Kunchithapadam, and T. Newhal, "The Paradyn Parallel Performance Measurement Tool," Computer, vol. 28, no. 11, pp. 37-46, Nov. 1995.
[19] V. Pillet, J. Labarta, T. Cortes, and S. Girona, "PARAVER: A Tool to Visualise and Analyze Parallel Code," Proc. WoTUG-18: Transputer and occam Developments, vol. 44, pp. 17-31, 1995.
[20] C.A. Schaefer, V. Pankratius, and W.F. Tichy, "Engineering Parallel Applications with Tunable Architectures," Proc. 32nd ACM/IEEE Int'l Conf. Software Eng. (ICSE '10), vol. 1, pp. 405-414, 2010.
[21] M. Schordan and D. Quinlan, "A Source-to-Source Architecture for User-Defined Optimizations," Proc. Joint Modular Languages Conf., pp. 214-223, 2003.
[22] R. Vuduc, J. Demmel, and K. Yelick, "OSKI: A Library of Automatically Tuned Sparse Matrix Kernels," Proc. SciDAC 2005, J. Physics: Conf. Series, 2005.
[23] H. Wen, S. Sbaraglia, S. Seelam, I. Chung, G. Cong, and D. Klepacki, "A Productivity Centered Tools Framework for Application Performance Tuning," QEST '07: Proc. Fourth Int'l Conf. Quantitative Evaluation of Systems, pp. 273-274, 2007.
[24] R. Whaley and J. Dongarra, "Automatically Tuned Linear Algebra Software (ATLAS)," Proc. Int'l Conf. Supercomputing (Supercomputing '98), www.netlib.org/utk/people/JackDongarra/PAPERS atlas-sc98.ps. Nov. 1998.

