The Community for Technology Leaders
2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (2013)
Austin, TX, USA USA
Apr. 21, 2013 to Apr. 23, 2013
ISBN: 978-1-4673-5776-0
TABLE OF CONTENTS
Papers

Author index (PDF)

pp. 269-270

[Blank page] (PDF)

pp. 268

Table of contents (PDF)

pp. iv-vii

[Front matter] (PDF)

pp. i-iii

Sampled simulation of multi-threaded applications (PDF)

Trevor E. Carlson , Department of Electronics and Information Systems, Ghent University, Belgium
Wim Heirman , Department of Electronics and Information Systems, Ghent University, Belgium
Lieven Eeckhout , Department of Electronics and Information Systems, Ghent University, Belgium
pp. 2-12

XAMP: An eXtensible Analytical Model Platform (PDF)

Yipeng Wang , Department of Electrical and Computer Engineering, North Carolina State University, USA
Yan Solihin , Department of Electrical and Computer Engineering, North Carolina State University, USA
pp. 13-23

Synergistic coupling of SSD and hard disk for QoS-aware virtual memory (PDF)

Ke Liu , ECE Department, Wayne State University, Detroit, MI 48202, USA
Xuechen Zhang , ECE Department, Wayne State University, Detroit, MI 48202, USA
Kei Davis , CCS Division, Los Alamos National Laboratory, NM 87545, USA
Song Jiang , ECE Department, Wayne State University, Detroit, MI 48202, USA
pp. 24-33

Increasing the Transparent Page Sharing in Java (PDF)

Kazunori Ogata , IBM Research - Tokyo, 5-6-52 Toyosu, Koto-ku, Japan
Tamiya Onodera , IBM Research - Tokyo, 5-6-52 Toyosu, Koto-ku, Japan
pp. 34-44

Understanding the implications of virtual machine management on processor microarchitecture design (PDF)

Xiufeng Sui , Advanced Computer Systems Laboratory, Institute of Computing Technology, CAS, Beijing, China
Tao Sun , Computer Science and Technology Department, University of Science and Technology of China, Hefei, China
Tao Li , Dept. of Electrical and Computer Engineering, University of Florida, USA
Lixin Zhang , Advanced Computer Systems Laboratory, Institute of Computing, Technology, CAS, Beijing, China
pp. 45-53

An analytical framework for estimating TCO and exploring data center design space (PDF)

Damien Hardy , University of Cyprus, Cyprus
Marios Kleanthous , University of Cyprus, Cyprus
Isidoros Sideris , University of Cyprus, Cyprus
Ali G. Saidi , ARM, Cyprus
Emre Ozer , ARM, Cyprus
Yiannakis Sazeides , University of Cyprus, Cyprus
pp. 54-63

Interactive analysis of large distributed systems with scalable topology-based visualization (PDF)

Lucas Mello Schnorr , INRIA MESCAL Research Team, CNRS LIG Laboratory, University of Grenoble, France
Arnaud Legrand , INRIA MESCAL Research Team, CNRS LIG Laboratory, University of Grenoble, France
Jean-Marc Vincent , INRIA MESCAL Research Team, CNRS LIG Laboratory, University of Grenoble, France
pp. 64-73

McSimA+: A manycore simulator with application-level+ simulation and detailed microarchitecture modeling (PDF)

Jung Ho Ahn , Seoul National University, Korea
Sheng Li , Hewlett-Packard Labs, USA
Seongil O , Seoul National University, Korea
Norman P. Jouppi , Hewlett-Packard Labs, USA
pp. 74-85

A detailed and flexible cycle-accurate Network-on-Chip simulator (PDF)

Nan Jiang , Stanford University, USA
James Balfour , Google Inc., USA
Daniel U. Becker , Stanford University, USA
Brian Towles , D.E. Shaw, USA
William J. Dally , NVIDIA Research/Stanford University, USA
George Michelogiannakis , Lawrence Berkeley National Lab, USA
John Kim , KAIST, USA
pp. 86-96

How a single chip causes massive power bills GPUSimPow: A GPGPU power simulator (PDF)

Jan Lucas , Embedded Systems Architecture Department, TU Berlin, Einsteinufer 17, D-10587, Germany
Sohan Lal , Embedded Systems Architecture Department, TU Berlin, Einsteinufer 17, D-10587, Germany
Michael Andersch , Embedded Systems Architecture Department, TU Berlin, Einsteinufer 17, D-10587, Germany
Mauricio Alvarez-Mesa , Embedded Systems Architecture Department, TU Berlin, Einsteinufer 17, D-10587, Germany
Ben Juurlink , Embedded Systems Architecture Department, TU Berlin, Einsteinufer 17, D-10587, Germany
pp. 97-106

Parallel GPU architecture simulation framework exploiting work allocation unit parallelism (PDF)

Sangpil Lee , School of Electrical and Electronic Engineering, Yonsei University, Seoul, Republic of Korea
Won Woo Ro , School of Electrical and Electronic Engineering, Yonsei University, Seoul, Republic of Korea
pp. 107-117

Evaluating cache coherent shared virtual memory for heterogeneous multicore chips (PDF)

Blake A. Hechtman , Department of Electrical and Computer Engineering, Duke University, USA
Daniel J. Sorin , Department of Electrical and Computer Engineering, Duke University, USA
pp. 118-119

Exascale workload characterization and architecture implications (PDF)

Prasanna Balaprakash , Argonne National Laboratory, Mathematics and Computer Science Division, USA
Darius Buntinas , Argonne National Laboratory, Mathematics and Computer Science Division, USA
Anthony Chan , Argonne National Laboratory, Mathematics and Computer Science Division, USA
Apala Guha , Department of Computer Science, University of Chicago, USA
Rinku Gupta , Argonne National Laboratory, Mathematics and Computer Science Division, USA
Sri Hari Krishna Narayanan , Argonne National Laboratory, Mathematics and Computer Science Division, USA
Andrew A. Chien , Argonne National Laboratory, Mathematics and Computer Science Division, USA
Paul Hovland , Argonne National Laboratory, Mathematics and Computer Science Division, USA
Boyana Norris , Argonne National Laboratory, Mathematics and Computer Science Division, USA
pp. 120-121

EMERALD: Characterization of emerging applications and algorithms for low-power devices (PDF)

Chuanjun Zhang , Intel Labs, Intel Corporation, USA
Glenn G. Ko , University of Illinois, USA
Jung Wook Choi , University of Illinois, USA
Shang-nien Tsai , University of Illinois, USA
Minje Kim , University of Illinois, USA
Abner Guzman Rivera , University of Illinois, USA
Rob Rutenbar , University of Illinois, USA
Paris Smaragdis , University of Illinois, USA
Mi Sun Park , Penn State University, USA
Vijaykrishnan Narayanan , Penn State University, USA
Hongyi Xin , Carnegie Mellon University, USA
Onur Mutlu , Carnegie Mellon University, USA
Bin Li , Intel Labs, Intel Corporation, USA
Li Zhao , Intel Labs, Intel Corporation, USA
Mei Chen , Intel Labs, Intel Corporation, USA
Ravi Iyer , Intel Labs, Intel Corporation, USA
pp. 122-123

PAPI 5: Measuring power, energy, and the cloud (PDF)

Vincent M. Weaver , Electrical and Computer Engineering, University of Maine, USA
Dan Terpstra , Innovative Computing Lab, University of Tennessee, USA
Heike McCraw , Innovative Computing Lab, University of Tennessee, USA
Matt Johnson , Innovative Computing Lab, University of Tennessee, USA
Kiran Kasichayanula , Innovative Computing Lab, University of Tennessee, USA
James Ralph , Innovative Computing Lab, University of Tennessee, USA
John Nelson , Innovative Computing Lab, University of Tennessee, USA
Phil Mucci , Innovative Computing Lab, University of Tennessee, USA
Tushar Mohan , Minimal Metrics, USA
Shirley Moore , Computer and Computational Sciences, University of Texas at El Paso, USA
pp. 124-125

Energy efficiency of lossless data compression on a mobile device: An experimental evaluation (PDF)

Armen Dzhagaryan , Department of Electrical and Computer Engineering, The University of Alabama in Huntsville, U.S.A.
Aleksandar Milenkovic , Department of Electrical and Computer Engineering, The University of Alabama in Huntsville, U.S.A.
Martin Burtscher , Department of Computer Science, Texas State University-San Marcos, U.S.A.
pp. 126-127

Virtual Power Management simulation framework for computer systems (PDF)

Bishop Brock , IBM Corporation, USA
Srinivasan Ramani , IBM Corporation, USA
Ken Vu , IBM Corporation, USA
Heather Hanson , IBM Corporation, USA
Michael Floyd , IBM Corporation, USA
pp. 128-129

Characterizing the microarchitectural side effects of operating system calls (PDF)

Addison Mayberry , School of Computer Science, University of Massachusetts, Amherst, 01002, USA
Matthew Laquidara , School of Computer Science, University of Massachusetts, Amherst, 01002, USA
Charles Weems , School of Computer Science, University of Massachusetts, Amherst, 01002, USA
pp. 130-131

QTrace: An interface for customizable full system instrumentation (PDF)

Xin Tong , University of Toronto, Canada
Jack Luo , University of Toronto, Canada
Andreas Moshovos , University of Toronto, Canada
pp. 132-133

A statistical machine learning based modeling and exploration framework for run-time cross-stack energy optimization (PDF)

Changshu Zhang , Department of Electrical and Computer Engineering, University of North Carolina at Charlotte, USA
Arun Ravindran , Department of Electrical and Computer Engineering, University of North Carolina at Charlotte, USA
pp. 136-137

Use of simple analytic performance models for streaming data applications deployed on diverse architectures (PDF)

Jonathan C. Beard , Dept. of Computer Science and Engineering, Washington University in St. Louis, USA
Roger D. Chamberlain , Dept. of Computer Science and Engineering, Washington University in St. Louis, USA
pp. 138-139

A circuit-architecture co-optimization framework for evaluating emerging memory hierarchies (PDF)

Xiangyu Dong , Qualcomm Technology, Inc., USA
Norman P. Jouppi , Hewlett-Packard Labs, USA
Yuan Xie , Pennsylvania State University, USA
pp. 140-141

A mathematical hard disk timing model for full system simulation (PDF)

Benjamin S. Parsons , School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907, USA
Vijay S. Pai , School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907, USA
pp. 143-153

Wall-clock based synchronization: A parallel simulation technology for cluster systems (PDF)

Xiaodong Zhu , School of Computer Science and Technology, University of Science and Technology of China, Hefei, China
Junmin Wu , School of Computer Science and Technology, University of Science and Technology of China, Hefei, China
Guoliang Chen , School of Computer Science and Technology, University of Science and Technology of China, Hefei, China
Tao Li , Department of Electrical and Computer Engineering, University of Florida, Gainesville, USA
pp. 154-162

Performance analysis of broadcasting algorithms on the Intel Single-Chip Cloud Computer (PDF)

John Matienzo , Department of Electrical and Computer Engineering, University of Toronto, Ontario, Canada
Natalie Enright Jerger , Department of Electrical and Computer Engineering, University of Toronto, Ontario, Canada
pp. 163-172

Selecting benchmark combinations for the evaluation of multicore throughput (PDF)

Ricardo A. Velasquez , INRIA/IRISA, Rennes, France
Pierre Michaud , INRIA/IRISA, Rennes, France
Andre Seznec , INRIA/IRISA, Rennes, France
pp. 173-182

Pinpointing data locality bottlenecks with low overhead (PDF)

Xu Liu , Dept. of Computer Science MS 132, Rice University, P.O. Box 1892, Houston, TX 77251-1892, USA
John Mellor-Crummey , Dept. of Computer Science MS 132, Rice University, P.O. Box 1892, Houston, TX 77251-1892, USA
pp. 183-193

Power measurement techniques on standard compute nodes: A quantitative comparison (PDF)

Daniel Hackenberg , Center for Information Services and High Performance Computing (ZIH), Technische Universität Dresden - 01062, Germany
Thomas Ilsche , Center for Information Services and High Performance Computing (ZIH), Technische Universität Dresden - 01062, Germany
Robert Schone , Center for Information Services and High Performance Computing (ZIH), Technische Universität Dresden - 01062, Germany
Daniel Molka , Center for Information Services and High Performance Computing (ZIH), Technische Universität Dresden - 01062, Germany
Maik Schmidt , Center for Information Services and High Performance Computing (ZIH), Technische Universität Dresden - 01062, Germany
Wolfgang E. Nagel , Center for Information Services and High Performance Computing (ZIH), Technische Universität Dresden - 01062, Germany
pp. 194-204

Non-determinism and overcount on modern hardware performance counter implementations (PDF)

Vincent M. Weaver , Electrical and Computer Engineering, University of Maine, USA
Dan Terpstra , Innovative Computing Lab, University of Tennessee, USA
Shirley Moore , Computer and Computational Sciences, University of Texas at El Paso, USA
pp. 215-224

Characterizing scalar opportunities in GPGPU applications (PDF)

Zhongliang Chen , Department of Electrical andComputer Engineering, Northeastern University, Boston, MA 02115, USA
David Kaeli , Department of Electrical andComputer Engineering, Northeastern University, Boston, MA 02115, USA
Norman Rubin , NVIDIA Corporation, USA
pp. 225-234

Quantifying the energy efficiency of FFT on heterogeneous platforms (PDF)

Yash Ukidave , Department of Electrical & Computer Engineering, Northeastern University, Boston, USA
Amir Kavyan Ziabari , Department of Electrical & Computer Engineering, Northeastern University, Boston, USA
Perhaad Mistry , Department of Electrical & Computer Engineering, Northeastern University, Boston, USA
Gunar Schirner , Department of Electrical & Computer Engineering, Northeastern University, Boston, USA
David Kaeli , Department of Electrical & Computer Engineering, Northeastern University, Boston, USA
pp. 235-244

Evaluating STT-RAM as an energy-efficient main memory alternative (PDF)

Emre Kultursay , The Pennsylvania State University, USA
Mahmut Kandemir , The Pennsylvania State University, USA
Anand Sivasubramaniam , The Pennsylvania State University, USA
Onur Mutlu , Carnegie Mellon University, USA
pp. 256-267
93 ms
(Ver 3.3 (11022016))