The Community for Technology Leaders
SC Conference (2011)
Seattle, Washington
Nov. 12, 2011 to Nov. 18, 2011
ISBN: 978-1-4503-0771-0
TABLE OF CONTENTS
Papers

SC 2011 Keynote (Abstract)

pp. i

First-principles calculations of electron states of a silicon nanowire with 100,000 atoms on the K computer (Abstract)

Yukihiro Hasegawa , Next-Generation Supercomputer R&D Center, Riken
Jun-Ichi Iwata , University of Tsukuba
Miwako Tsuji , University of Tsukuba
Daisuke Takahashi , University of Tsukuba
Atsushi Oshiyama , The University of Tokyo
Kazuo Minami , Next-Generation Supercomputer R&D Center, Riken
Taisuke Boku , University of Tsukuba
Fumiyoshi Shoji , Next-Generation Supercomputer R&D Center, Riken
Atsuya Uno , Next-Generation Supercomputer R&D Center, Riken
Motoyoshi Kurokawa , Next-Generation Supercomputer R&D Center, Riken
Hikaru Inoue , Technical Computing Solution Unit, Fujitsu Limited
Ikuo Miyoshi , Next Generation Technical Computing Unit, Fujitsu Limited
Mitsuo Yokokawa , Next-Generation Supercomputer R&D Center, Riken
pp. 1-11

Atomistic nanoelectronic device engineering with sustained performances up to 1.44 PFlop/s (Abstract)

Mathieu Luisier , Purdue University, West Lafayette, IN
Timothy B. Boykin , University of Alabama in Huntsville, Huntsville, AL
Gerhard Klimeck , Purdue University, West Lafayette, IN
Wolfgang Fichtner , Integrated Systems Laboratory, ETH Zürich, Zürich, Switzerland
pp. 1-11

Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer (Abstract)

Takashi Shimokawabe , Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, Japan
Takayuki Aoki , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST
Tomohiro Takaki , Kyoto Institute of Technology, Gosyokaido-cyo, Matsugasaki, Sakyo-ku, Kyoto, Japan
Toshio Endo , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST
Akinori Yamanaka , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
Naoya Maruyama , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST
Akira Nukada , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST
Satoshi Matsuoka , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST and National Institute of Informatics
pp. 1-11

Petaflop biofluidics simulations on a two million-core system (Abstract)

Massimo Bernaschi , CNR-IAC, Istituto Applicazioni, Calcolo, Consiglio Nazionale delle, Ricerche, Rome, Italy
Mauro Bisson , Harvard University, Cambridge, MA
Toshio Endo , Tokyo Institute of Technology, Tokyo, Japan
Satoshi Matsuoka , Tokyo Institute of Technology, Tokyo, Japan
Massimiliano Fatica , Nvidia Corp., Santa Clara, CA
Simone Melchionna , CNR-IPCF, Istituto Processi, Chimico-Fisici, Consiglio Nazionale delle, Ricerche, Rome, Italy
pp. 1-12

A new computational paradigm in multiscale simulations: application to brain blood flow (Abstract)

Leopold Grinberg , Brown University, Providence, RI
Joseph A. Insley , Argonne National Laboratory, Argonne, IL
Vitali Morozov , Argonne National Laboratory, Argonne, IL
Michael E. Papka , Argonne National Laboratory, Argonne, Illinois
George Em Karniadakis , Brown University, Providence, RI
Dmitry Fedosov , Institute of Complex Systems, FZ Juelich, Juelich, Germany
Kalyan Kumaran , Argonne National Laboratory, Argonne, Illinois
pp. 1-5

Optimizing symmetric dense matrix-vector multiplication on GPUs (Abstract)

Rajib Nath , University of California, San Diego
Stanimire Tomov , University of Tennessee, Knoxville
Tingxing "Tim" Dong , University of Tennessee, Knoxville
Jack Dongarra , University of Tennessee, Knoxville
pp. 1-10

Tiled QR factorization algorithms (Abstract)

Henricus Bouwmeester , University of Colorado Denver
Mathias Jacquelin , ENS Lyon, France
Julien Langou , University of Colorado Denver
Yves Robert , ENS Lyon, France
pp. 1-11

Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels (Abstract)

Azzam Haidar , University of Tennessee, Knoxville, TN
Hatem Ltaief , KAUST Supercomputing Laboratory, Thuwal, Saudi Arabia
Jack Dongarra , University of Tennessee, Knoxville, TN
pp. 1-11

Liszt: a domain specific language for building portable mesh-based PDE solvers (Abstract)

Zachary DeVito , Stanford University
Niels Joubert , Stanford University
Francisco Palacios , Stanford University
Stephen Oakley , Stanford University
Montserrat Medina , Stanford University
Mike Barrientos , Stanford University
Erich Elsen , Stanford University
Frank Ham , Stanford University
Alex Aiken , Stanford University
Karthik Duraisamy , Stanford University
Eric Darve , Stanford University
Juan Alonso , Stanford University
Pat Hanrahan , Stanford University
pp. 1-12

Simplified parallel domain traversal (Abstract)

Wesley Kendall , The University of Tennessee, Knoxville
Jingyuan Wang , The University of Tennessee, Knoxville
Melissa Allen , The University of Tennessee, Knoxville
Tom Peterka , Argonne National Laboratory
Jian Huang , The University of Tennessee, Knoxville
David Erickson , Oak Ridge National Laboratory
pp. 1-11

Physis: an implicitly parallel programming model for stencil computations on large-scale GPU-accelerated supercomputers (Abstract)

Naoya Maruyama , Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, Japan
Tatsuo Nomura , Google, Inc., Roppongi, Minato-ku, Tokyo, Japan
Kento Sato , Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, Japan
Satoshi Matsuoka , Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, Japan
pp. 1-12

CudaDMA: optimizing GPU memory bandwidth via warp specialization (Abstract)

Michael Bauer , Stanford University
Henry Cook , UC Berkeley
Brucek Khailany , NVIDIA Research
pp. 1-11

Dymaxion: optimizing memory access patterns for heterogeneous systems (Abstract)

Shuai Che , University of Virginia
Jeremy W. Sheaffer , University of Virginia
Kevin Skadron , University of Virginia
pp. 1-11

GROPHECY: GPU performance projection from CPU code skeletons (Abstract)

Jiayuan Meng , Argonne National Laboratory
Vitali A. Morozov , Argonne National Laboratory
Kalyan Kumaran , Argonne National Laboratory
Venkatram Vishwanath , Argonne National Laboratory
Thomas D. Uram , Argonne National Laboratory
pp. 1-11

Parallel random numbers: as easy as 1, 2, 3 (Abstract)

John K. Salmon , D. E. Shaw Research, New York, NY
Mark A. Moraes , D. E. Shaw Research, New York, NY
Ron O. Dror , D. E. Shaw Research, New York, NY
David E. Shaw , D. E. Shaw Research, New York, NY
pp. 1-12

Server-side I/O coordination for parallel file systems (Abstract)

Huaiming Song , Illinois Institute of Technology, Chicago, IL
Yanlong Yin , Illinois Institute of Technology, Chicago, IL
Xian-He Sun , Illinois Institute of Technology, Chicago, IL
Rajeev Thakur , Argonne National Laboratory, Argonne, IL
Samuel Lang , Argonne National Laboratory, Argonne, IL
pp. 1-11

QoS support for end users of I/O-intensive applications using shared storage systems (Abstract)

Xuechen Zhang , Wayne State University, Detroit, MI
Kei Davis , Los Alamos National Laboratory, Los Alamos, NM
Song Jiang , Wayne State University, Detroit, MI
pp. 1-12

Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems (Abstract)

Venkatram Vishwanath , Argonne National Laboratory, Argonne, IL
Mark Hereld , Argonne National Laboratory, Argonne, IL
Vitali Morozov , Argonne National Laboratory, Argonne, IL
Michael E. Papka , Argonne National Laboratory, Argonne, IL
pp. 1-11

GreenSlot: scheduling energy consumption in green datacenters (Abstract)

Íñigo Goiri , UPC/BSC and Rutgers Univ.
Ryan Beauchea , Rutgers University
Kien Le , Rutgers University
Thu D. Nguyen , Rutgers University
Md. E. Haque , Rutgers University
Jordi Guitart , UPC/BSC
Jordi Torres , UPC/BSC
Ricardo Bianchini , Rutgers University
pp. 1-11

A 'cool' load balancer for parallel applications (Abstract)

Osman Sarood , University of Illinois at Urbana-Champaign, Urbana, IL
Laxmikant V. Kale , University of Illinois at Urbana-Champaign, Urbana, IL
pp. 1-11

Reducing electricity cost through virtual machine placement in high performance computing clouds (Abstract)

Kien Le , Rutgers University
Ricardo Bianchini , Rutgers University
Jingru Zhang , Rutgers University
Yogesh Jaluria , Rutgers University
Jiandong Meng , Rutgers University
Thu D. Nguyen , Rutgers University
pp. 1-12

Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems (Abstract)

Kamesh Madduri , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
Khaled Z. Ibrahim , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
Samuel Williams , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
Eun-Jin Im , Kookmin University, Seoul, Korea
Stephane Ethier , Princeton Plasma Physics Laboratory, Princeton
John Shalf , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
Leonid Oliker , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
pp. 1-12

Unitary qubit lattice simulations of multiscale phenomena in quantum turbulence (Abstract)

George Vahala , William & Mary, Williamsburg, VA
Min Soe , Rogers State University, Claremore, OK
Bo Zhang , William & Mary, Williamsburg, VA
Jeffrey Yepez , Air Force Research Laboratories, Hanscom AFB, MA
Linda Vahala , Old Dominion University, Norfolk, VA
Jonathan Carter , Computing Sciences, Lawrence Berkeley National Laboratory MS, Berkeley, CA
Sean Ziegeler , High Performance Technologies, Inc., Reston, VA
pp. 1-11

An image compositing solution at scale (Abstract)

Kenneth Moreland , Sandia National Laboratories
Wesley Kendall , University of Tennessee, Knoxville
Tom Peterka , Argonne National Laboratory
Jian Huang , University of Tennessee, Knoxville
pp. 1-10
Papers

High-efficiency server design (Abstract)

Ali Heydari , Facebook
Harry Li , Facebook
Amir Michael , Facebook
Jacob Na , Facebook
Avery Nisbet , Facebook
Pierluigi Sarti , Facebook
pp. 1-27

Using the TOP500 to trace and project technology and architecture trends (Abstract)

Peter M. Kogge , Univ. of Notre Dame, Fitzpatrick Hall, Notre Dame, IN
Timothy J. Dysart , Univ. of Notre Dame, Fitzpatrick Hall, Notre Dame, IN
pp. 1-11

I/O streaming evaluation of batch queries for data-intensive computational turbulence (Abstract)

Kalin Kanov , Johns Hopkins University, Baltimore, Maryland
Eric Perlman , Johns Hopkins University, Baltimore, Maryland
Randal Burns , Johns Hopkins University, Baltimore, Maryland
Yanif Ahmad , Johns Hopkins University, Baltimore, Maryland
Alexander Szalay , Johns Hopkins University, Baltimore, Maryland
pp. 1-10

ISABELA-QA: query-driven analytics with ISABELA-compressed extreme-scale scientific data (Abstract)

Sriram Lakshminarasimhan , North Carolina State University, NC and Oak Ridge National Laboratory, Oak Ridge, TN
John Jenkins , North Carolina State University, NC and Oak Ridge National Laboratory, Oak Ridge, TN
Isha Arkatkar , North Carolina State University, NC and Oak Ridge National Laboratory, Oak Ridge, TN
Zhenhuan Gong , North Carolina State University, NC
Hemanth Kolla , Sandia National Laboratory, Livermore, CA
Seung-Hoe Ku , New York University, New York, NY
Stephane Ethier , Princeton Plasma Physics Laboratory, Princeton, NJ
Jackie Chen , Sandia National Laboratory, Livermore, CA
C. S. Chang , New York University, New York, NY
Scott Klasky , Oak Ridge National Laboratory, Oak Ridge, TN
Robert Latham , Argonne National Laboratory, Argonne, IL
Robert Ross , Argonne National Laboratory, Argonne, IL
Nagiza F. Samatova , North Carolina State University, NC and Oak Ridge National Laboratory, Oak Ridge, TN
pp. 1-11

FTI: high performance fault tolerance interface for hybrid systems (Abstract)

Leonardo Bautista-Gomez , Tokyo Institute of Technology, INRIA
Seiji Tsuboi , JAMSTEC
Dimitri Komatitsch , University of Toulouse
Franck Cappello , INRIA, University of Illinois
Naoya Maruyama , Tokyo Institute of Technology
Satoshi Matsuoka , Tokyo Institute of Technology
pp. 1-32

Checkpointing strategies for parallel jobs (Abstract)

Marin Bougeret , ENS Lyon, France
Henri Casanova , Univ. of Hawai'i at Manoa, Honolulu
Mikael Rabie , ENS Lyon, France
Yves Robert , ENS Lyon, France
Frédéric Vivien , INRIA, Lyon, France
pp. 1-11

BlobCR: efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots (Abstract)

Bogdan Nicolae , INRIA Saclay, ?le-de-France, France
Franck Cappello , University of Illinois at Urbana Champaign
pp. 1-12

Fast implementation of DGEMM on Fermi GPU (Abstract)

Guangming Tan , Key Laboratory of Computer Architecture, Institute of Computing Technology
Linchuan Li , Key Laboratory of Computer Architecture, Institute of Computing Technology
Sean Triechle , Nvidia Corporation
Everett Phillips , Nvidia Corporation
Yungang Bao , Key Laboratory of Computer Architecture, Institute of Computing Technology
Ninghui Sun , Key Laboratory of Computer Architecture, Institute of Computing Technology
pp. 1-11

Scalable fast multipole methods on distributed heterogeneous architectures (Abstract)

Qi Hu , University of Maryland, College Park
Nail A. Gumerov , University of Maryland, College Park
Ramani Duraiswami , University of Maryland, College Park
pp. 1-12

Multi-science applications with single codebase - GAMER - for massively parallel architectures (Abstract)

Hemant Shukla , Lawrence Berkeley National laboratory, Berkeley
Hsi-Yu Schive , National Taiwan University, Taipei, Taiwan
Tak-Pong Woo , Soochow University, Taipei, Taiwan
Tzihong Chiueh , National Taiwan University, Taipei, Taiwan
pp. 1-11

Virtual I/O caching: dynamic storage cache management for concurrent workloads (Abstract)

Michael Frasca , Pennsylvania State University, University Park, Pennsylvania
Ramya Prabhakar , Pennsylvania State University, University Park, Pennsylvania
Padma Raghavan , Pennsylvania State University, University Park, Pennsylvania
Mahmut Kandemir , Pennsylvania State University, University Park, Pennsylvania
pp. 1-11

SCMFS: a file system for storage class memory (Abstract)

Xiaojian Wu , Texas A&M University
A. L. Narasimha Reddy , Texas A&M University
pp. 1-11

Optimized pre-copy live migration for memory intensive applications (Abstract)

Khaled Z. Ibrahim , Lawrence Berkeley National Laboratory, Berkeley
Steven Hofmeyr , Lawrence Berkeley National Laboratory, Berkeley
Costin Iancu , Lawrence Berkeley National Laboratory, Berkeley
Eric Roman , Lawrence Berkeley National Laboratory, Berkeley
pp. 1-11

Scalable hashing for shared memory supercomputers (Abstract)

Eric Goodman , Sandia National Laboratories, Albuquerque, NM
M. Nicole Lemaster , Sandia National Laboratories, Livermore, CA
Edward Jimenez , Sandia National Laboratories, Albuquerque, NM
pp. 1-11

An early performance analysis of POWER7-IH HPC systems (Abstract)

Kevin J. Barker , Performance and Architecture Lab, Pacific Northwest National Laboratory, Richland, WA
Adolfy Hoisie , Performance and Architecture Lab, Pacific Northwest National Laboratory, Richland, WA
Darren J. Kerbyson , Performance and Architecture Lab, Pacific Northwest National Laboratory, Richland, WA
pp. 1-11

A similarity measure for time, frequency, and dependencies in large-scale workloads (Abstract)

Mario Lassnig , University of Innsbruck, Innsbruck, Austria
Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
Vincent Garonne , European Organisation for Nuclear Research, Geneva, Switzerland
Angelos Molfetas , European Organisation for Nuclear Research, Geneva, Switzerland
Martin Barisits , European Organisation for Nuclear Research, Geneva, Switzerland
pp. 1-11

Evaluating the viability of process replication reliability for exascale systems (Abstract)

Kurt Ferreira , Sandia National Laboratories
Jon Stearley , Sandia National Laboratories
James H. Laros , Sandia National Laboratories
Ron Oldfield , Sandia National Laboratories
Kevin Pedretti , Sandia National Laboratories
Ron Brightwell , Sandia National Laboratories
Rolf Riesen , IBM Research, Ireland
Patrick G. Bridges , University of New Mexico
Dorian Arnold , University of New Mexico
pp. 1-12

Modeling and tolerating heterogeneous failures in large parallel systems (Abstract)

Eric Heien , INRIA, France
Derrick Kondo , INRIA, France
Ana Gainaru , University Politehnica of Bucharest
Dan LaPine , UIUC, NCSA, Urbana, IL
Bill Kramer , NCSA, UIUC, Urbana, IL
Franck Cappello , INRIA, France, UIUC, Urbana, IL
pp. 1-11

System implications of memory reliability in exascale computing (Abstract)

Sheng Li , Hewlett-Packard Labs
Ke Chen , University of Notre Dame and Hewlett-Packard Labs
Ming-Yu Hsieh , Sandia National Labs
Naveen Muralimanohar , Hewlett-Packard Labs
Chad D. Kersey , Georgia Institute of Technology
Jay B. Brockman , University of Notre Dame
Arun F. Rodrigues , Sandia National Labs
Norman P. Jouppi , Hewlett-Packard Labs
pp. 1-12

Flexible resource allocation for reliable virtual cluster computing systems (Abstract)

Thomas J. Hacker , Purdue University, West Lafayette, IN
Kanak Mahadik , Purdue University, West Lafayette, IN
pp. 1-12

Auto-scaling to minimize cost and meet application deadlines in cloud workflows (Abstract)

Ming Mao , University of Virginia, Charlottesville, VA
Marty Humphrey , University of Virginia, Charlottesville, VA
pp. 1-12

Large scale debugging of parallel tasks with AutomaDeD (Abstract)

Ignacio Laguna , Purdue University, West Lafayette, IN
Todd Gamblin , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Bronis R. de Supinski , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Saurabh Bagchi , Purdue University, West Lafayette, IN
Greg Bronevetsky , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Dong H. Anh , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Martin Schulz , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Barry Rountree , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
pp. 1-10

Efficient data race detection for distributed memory parallel programs (Abstract)

Chang-Seo Park , University of California, Berkeley
Koushik Sen , University of California, Berkeley
Paul Hargrove , Lawrence Berkeley National Laboratory
Costin Iancu , Lawrence Berkeley National Laboratory
pp. 1-12

Sniper: exploring the level of abstraction for scalable and accurate parallel multi-core simulation (Abstract)

Trevor E. Carlson , Ghent University, Belgium and Intel ExaScience Lab, Leuven, Belgium
Wim Heirman , Ghent University, Belgium and Intel ExaScience Lab, Leuven, Belgium
Lieven Eeckhout , Ghent University, Belgium
pp. 1-12

Performance of the community earth system model (Abstract)

Patrick H. Worley , Oak Ridge National Laboratory, Oak Ridge, TN
Arthur A. Mirin , Lawrence Livermore National Laboratory, Livermore, CA
Anthony P. Craig , National Center for Atmospheric Research, CO
Mark A. Taylor , Sandia National Laboratories, Albuquerque, NM
John M. Dennis , National Center for Atmospheric Research, CO
Mariana Vertenstein , National Center for Atmospheric Research, CO
pp. 1-11

Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning (Abstract)

Samuel Williams , Lawrence Berkeley National Laboratory
Leonid Oliker , Lawrence Berkeley National Laboratory
Jonathan Carter , Lawrence Berkeley National Laboratory
John Shalf , Lawrence Berkeley National Laboratory
pp. 1-12

Highly scalable ab initio genomic motif identification (Abstract)

Benoît Marchand , King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Vladimir B. Bajic , King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Dinesh K. Kaushik , King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
pp. 1-10

Hadoop acceleration through network levitated merge (Abstract)

Yandong Wang , Auburn University
Xinyu Que , Auburn University
Weikuan Yu , Auburn University
Dror Goldenberg , Mellanox Technologies
Dhiraj Sehgal , Mellanox Technologies
pp. 1-10

Purlieus: locality-aware resource allocation for MapReduce in a cloud (Abstract)

Balaji Palanisamy , College of Computing Georgia Tech
Aameek Singh , IBM Research - Almaden
Ling Liu , College of Computing Georgia Tech
Bhushan Jain , IBM India Software Lab
pp. 1-11

A distributed look-up architecture for text mining applications using MapReduce (Abstract)

Atilla Soner Balkir , University of Chicago, Chicago, IL
Ian Foster , University of Chicago, Chicago, IL
Andrey Rzhetsky , University of Chicago, Chicago, IL
pp. 1-11

Copernicus: a new paradigm for parallel adaptive molecular dynamics (Abstract)

Sander Pronk , Royal Institute of Technology, Stockholm, Sweden
Per Larsson , University of Virginia, Charlottesville, VA
Iman Pouya , Royal Institute of Technology, Stockholm, Sweden
Gregory R. Bowman , Stanford University, Stanford, CA
Imran S. Haque , Stanford University, Stanford, CA
Kyle Beauchamp , Stanford University, Stanford, CA
Berk Hess , Royal Institute of Technology, Stockholm, Sweden
Vijay S. Pande , Stanford University, Stanford, CA
Peter M. Kasson , University of Virginia, Charlottesville, VA
Erik Lindahl , Royal Institute of Technology, Stockholm, Sweden
pp. 1-10

Enabling and scaling biomolecular simulations of 100 million atoms on petascale machines with a multicore-optimized message-driven runtime (Abstract)

Chao Mei , University of Illinois at Urbana-Champaign, Urbana, IL
Yanhua Sun , University of Illinois at Urbana-Champaign, Urbana, IL
Gengbin Zheng , University of Illinois at Urbana-Champaign, Urbana, IL
Eric J. Bohm , University of Illinois at Urbana-Champaign, Urbana, IL
Laxmikant V. Kale , University of Illinois at Urbana-Champaign, Urbana, IL
James C. Phillips , University of Illinois at Urbana-Champaign, Urbana, IL
Chris Harrison , University of Illinois at Urbana-Champaign, Urbana, IL
pp. 1-11

Parallelization design on multi-core platforms in density matrix renormalization group toward 2-D quantum strongly-correlated systems (Abstract)

Susumu Yamada , Japan Atomic Energy Agency, Kashiwanoha, Kashiwa-shi, Chiba, Japan
Toshiyuki Imamura , The University of Electro-Communications, Chofugaoka, Chofu-shi, Tokyo, Japan
Masahiko Machida , Japan Atomic Energy Agency, Kashiwanoha, Kashiwa-shi, Chiba, Japan
pp. 1-10

A scalable eigensolver for large scale-free graphs using 2D graph partitioning (Abstract)

Andy Yoo , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory
Allison H. Baker , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory
Roger Pearce , Texas A&M University
Van Emden Henson , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory
pp. 1-11

Scalable stochastic optimization of complex energy systems (Abstract)

Miles Lubin , Argonne National Laboratory, Argonne, IL
Cosmin G. Petra , Argonne National Laboratory, Argonne, IL
Mihai Anitescu , Argonne National Laboratory, Argonne, IL
Victor Zavala , Argonne National Laboratory, Argonne, IL
pp. 1-64

Parallel breadth-first search on distributed memory systems (Abstract)

Aydin Buluç , Lawrence Berkeley National Laboratory, Berkeley, CA
Kamesh Madduri , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-12

SciHadoop: array-based query processing in Hadoop (Abstract)

Joe B. Buck , UC Santa Cruz
Noah Watkins , UC Santa Cruz
Jeff LeFevre , UC Santa Cruz
Kleoni Ioannidou , UC Santa Cruz
Carlos Maltzahn , UC Santa Cruz
Neoklis Polyzotis , UC Santa Cruz
Scott Brandt , UC Santa Cruz
pp. 1-11

On the duality of data-intensive file system design: reconciling HDFS and PVFS (Abstract)

Wittawat Tantisiriroj , Carnegie Mellon University
Seung Woo Son , Carnegie Mellon University
Swapnil Patil , Carnegie Mellon University
Samuel J. Lang , Argonne National Laboratory
Garth Gibson , Argonne National Laboratory
Robert B. Ross , Argonne National Laboratory
pp. 1-12

End-to-end network QoS via scheduling of flexible resource reservation requests (Abstract)

Sushant Sharma , Computational Science Center, Brookhaven National Laboratory, Upton, NY
Dimitrios Katramatos , Computational Science Center, Brookhaven National Laboratory, Upton, NY
Dantong Yu , Computational Science Center, Brookhaven National Laboratory, Upton, NY
pp. 1-10

High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach (Abstract)

Mikhail Smelyanskiy , Parallel Computing Labs, Intel
Karthikeyan Vaidyanathan , Parallel Computing Labs, Intel
Jee Choi , Georgia Institute of Technology
Bálint Joó , Thomas Jefferson National Accelerator
Jatin Chhugani , Parallel Computing Labs, Intel
Michael A. Clark , Harvard-Smithsonian Center for Astrophysics
Pradeep Dubey , Parallel Computing Labs, Intel
pp. 1-11

Scaling lattice QCD beyond 100 GPUs (Abstract)

R. Babich , Boston University, Boston, MA
M. A. Clark , Harvard-Smithsonian Center for Astrophysics, Cambridge, MA
B. Joó , Thomas Jefferson National, Newport News, VA
G. Shi , University of Illinois, Urbana, IL
R. C. Brower , Boston University, Boston, MA
S. Gottlieb , Indiana University, Bloomington, IN
pp. 1-11

Large scale plane wave pseudopotential density functional theory calculations on GPU clusters (Abstract)

Long Wang , Supercomputing Center of Computer, Network Information Center, Chinese Academy of Sciences, ZhongGuanCun, Beijing, China
Yue Wu , Fudan University, Shanghai, China
Weile Jia , Supercomputing Center of Computer, Network Information Center, Chinese Academy of Sciences, ZhongGuanCun, Beijing, China
Weiguo Gao , Fudan University, Shanghai, China
Xuebin Chi , Supercomputing Center of Computer, Network Information Center, Chinese Academy of Sciences, ZhongGuanCun, Beijing, China
Lin-Wang Wang , Lawrence, Berkeley National Laboratory, Berkeley, CA
pp. 1-10

Scalable implementations of accurate excited-state coupled cluster theories: application of high-level methods to porphyrin-based systems (Abstract)

Karol Kowalski , Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA
Sriram Krishnamoorthy , Pacific Northwest National Laboratory, Richland, WA
Ryan M. Olson , Cray, Incorporated, MN
Vinod Tipparaju , Oak Ridge National Laboratory, Oak Ridge, TN
E. Aprà , Oak Ridge National Laboratory, Oak Ridge, TN
pp. 1-10

Hardware/software co-design for energy-efficient seismic modeling (Abstract)

Jens Krueger , Lawrence Berkeley National Laboratory, Berkeley, CA and Fraunhofer ITWM, Kaiserslautern, Germany
David Donofrio , Lawrence Berkeley National Laboratory, Berkeley, CA
John Shalf , Lawrence Berkeley National Laboratory, Berkeley, CA
Marghoob Mohiyuddin , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
Samuel Williams , Lawrence Berkeley National Laboratory, Berkeley, CA
Leonid Oliker , Lawrence Berkeley National Laboratory, Berkeley, CA
Franz-Josef Pfreund , Fraunhofer ITWM, Kaiserslautern, Germany
pp. 1-12

Optimizing the Barnes-Hut algorithm in UPC (Abstract)

Junchao Zhang , University of Illinois at Urbana-Champaign
Babak Behzad , University of Illinois at Urbana-Champaign
Marc Snir , University of Illinois at Urbana-Champaign
pp. 1-11

Avoiding hot-spots on two-level direct networks (Abstract)

Abhinav Bhatele , Lawrence Livermore National Laboratory, Livermore, CA
Nikhil Jain , University of Illinois at Urbana-Champaign, Urbana, IL
William D. Gropp , University of Illinois at Urbana-Champaign, Urbana, IL
Laxmikant V. Kale , University of Illinois at Urbana-Champaign, Urbana, IL
pp. 1-11

Improving communication performance in dense linear algebra via topology aware collectives (Abstract)

Edgar Solomonik , University of California at Berkeley, Berkeley, CA
Abhinav Bhatele , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA
James Demmel , University of California at Berkeley, Berkeley, CA
pp. 1-11

Multithreaded Global Address Space Communication Techniques for Gyrokinetic Fusion Applications on Ultra-Scale Platforms (Abstract)

Robert Preissl , Lawrence Berkeley National Laboratory Berkeley, CA
Nathan Wichmann , CRAY Inc. St. Paul, MN
Bill Long , CRAY Inc. St. Paul, MN
John Shalf , Lawrence Berkeley National Laboratory Berkeley, CA
Stephane Ethier , Princeton Plasma Physics Laboratory Princeton, NJ
Alice Koniges , Lawrence Berkeley National Laboratory Berkeley, CA
pp. 1-11

Deep and wide metrics for HPC resource capability and project usage (Abstract)

David Hart , Nat. Center for Atmos. Res., Boulder, CO, USA
pp. 1-7

Challenges in the management of high-performance computing centers: An organizational perspective (Abstract)

Nicholas Berente , Terry Coll. of Bus., Univ. of Georgia, Athens, GA, USA
Jennifer Claggett , Terry Coll. of Bus., Univ. of Georgia, Athens, GA, USA
pp. 1-8

Integrating multi-touch in high-resolution display environments (Abstract)

Brandt Westing , Texas Adv. Comput. Center, Austin, TX, USA
Benjamin Urick , Texas Adv. Comput. Center, Austin, TX, USA
Maria Esteva , Texas Adv. Comput. Center, Austin, TX, USA
Freddy Rojas , Texas Adv. Comput. Center, Austin, TX, USA
Weijia Xu , Texas Adv. Comput. Center, Austin, TX, USA
pp. 1-9

How to measure useful, sustained performance (Abstract)

William Kramer , Nat. Center for Supercomput. Applic., Univ. of Illinois, Urbana, IL, USA
pp. 1-18

Performance modeling for systematic performance tuning (Abstract)

Torsten Hoefler , Nat. Center for Supercomput. Applic., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
William Gropp , Nat. Center for Supercomput. Applic., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Marc Snir , Nat. Center for Supercomput. Applic., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
William Kramer , Nat. Center for Supercomput. Applic., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
pp. 1-12

A long-distance infiniband interconnection between two clusters in production use (Abstract)

Sabine Richling , IT-Center, Univ. of Heidelberg, Heidelberg, Germany
Steffen Hau , IT-Center, Univ. of Mannheim, Mannheim, Germany
Heinz Kredel , IT-Center, Univ. of Mannheim, Mannheim, Germany
Hans-Gunther Kruse , IT-Center, Univ. of Mannheim, Mannheim, Germany
pp. 1-8

SPOTlight on testing: Stability, performance and operational testing of LANL HPC clusters (Abstract)

Georgia Pedicini , High Performance Comput. Syst., Los Alamos Nat. Lab., Los Alamos, NM, USA
Jennifer Green , High Performance Comput. Syst., Los Alamos Nat. Lab., Los Alamos, NM, USA
pp. 1-8

A Toolkit for Event Analysis and Logging (Abstract)

James Carey , IBM, Rochester, MN, USA
Philip Sanders , IBM, Rochester, MN, USA
pp. 1-7

Best practices for the deployment and management of production HPC clusters (Abstract)

Robert McLay , Texas Adv. Comput. Center (TACC), Univ. of Texas at Austin, Austin, TX, USA
Karl W. Schulz , Texas Adv. Comput. Center (TACC), Univ. of Texas at Austin, Austin, TX, USA
William L. Barth , Texas Adv. Comput. Center (TACC), Univ. of Texas at Austin, Austin, TX, USA
Tommy Minyard , Texas Adv. Comput. Center (TACC), Univ. of Texas at Austin, Austin, TX, USA
pp. 1-11

Logjam: A scalable unified log file archiver (Abstract)

Nicholas P. Cardo , Lawrence Berkeley Nat. Lab., Nat. Energy Res. Sci. Comput. Center, Berkeley, CA, USA
pp. 1-9

The NWSC benchmark suite: Using scientific throughput to measure supercomputer performance (Abstract)

Rory C. Kelly , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
Siddartha S. Ghosh , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
Si Liu , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
Davide Del Vento , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
Richard A. Valent , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
pp. 1-5

Qserv: A distributed shared-nothing database for the LSST catalog (Abstract)

Daniel L. Wang , SLAC Nat. Accel. Lab., Menlo Park, CA, USA
Serge M. Monkewitz , Infrared Process. & Anal. Center, California Inst. of Technol., Pasadena, CA, USA
Kian-Tat Lim , SLAC Nat. Accel. Lab., Menlo Park, CA, USA
Jacek Becla , SLAC Nat. Accel. Lab., Menlo Park, CA, USA
pp. 1-11

Challenges of HPC monitoring (Abstract)

William Allcock , Argonne Nat. Lab., Argonne, IL, USA
Evan Felix , Pacific Northwest Nat. Lab., Richland, WA, USA
Mike Lowe , Indiana Univ., Bloomington, IN, USA
Randal Rheinheimer , Los Alamos Nat. Lab., Los Alamos, NM, USA
Joshi Fullop , Nat. Center for Supercomput. Applic., USA
pp. 1-6

Adaptive simulation of turbulent flow past a full car model (Abstract)

Niclas Jansson , Comput. Technol. Lab., KTH R. Inst. of Tech., Stockholm, Sweden
Johan Hoffman , Comput. Technol. Lab., KTH R. Inst. of Tech., Stockholm, Sweden
Murtazo Nazarov , Comput. Technol. Lab., KTH R. Inst. of Tech., Stockholm, Sweden
pp. 1-8

World-highest resolution global atmospheric model and its performance on the Earth Simulator (Abstract)

Keiko Takahashi , Japan Agency for Marine-Earth Sci. & Technol., Yokosuka, Japan
Ken'ichi Itakura , Japan Agency for Marine-Earth Sci. & Technol., Yokosuka, Japan
Satom Okura , Japan Agency for Marine-Earth Sci. & Technol., Yokosuka, Japan
Kiinihiko Watanabe , Japan Agency for Marine-Earth Sci. & Technol., Yokosuka, Japan
pp. 1-12

A survey of the practice of computational science (Abstract)

Prakash Prabhu , Princeton Univ., Princeton, NJ, USA
Thomas B. Jablin , Princeton Univ., Princeton, NJ, USA
Arun Raman , Princeton Univ., Princeton, NJ, USA
Yun Zhang , Princeton Univ., Princeton, NJ, USA
Jialu Huang , Princeton Univ., Princeton, NJ, USA
Hanjun Kim , Princeton Univ., Princeton, NJ, USA
Nick P. Johnson , Princeton Univ., Princeton, NJ, USA
Feng Liu , Princeton Univ., Princeton, NJ, USA
Soumyadeep Ghosh , Princeton Univ., Princeton, NJ, USA
Stephen Beard , Princeton Univ., Princeton, NJ, USA
Taewook Oh , Princeton Univ., Princeton, NJ, USA
Matthew Zoufaly , Princeton Univ., Princeton, NJ, USA
David Walker , Princeton Univ., Princeton, NJ, USA
David I. August , Princeton Univ., Princeton, NJ, USA
pp. 1-12

Performance evaluations of gyrokinetic Eulerian code GT5D on massively parallel multi-core platforms (Abstract)

Yasuhiro Idomura , Japan Atomic Energy Agency, Tokyo, Japan
Sebastien Jolliet , Japan Atomic Energy Agency, Tokyo, Japan
pp. 1-9

Janus: Co-designing HPC systems and facilities (Abstract)

Henry M. Tufo , Univ. of Colorado, Boulder, Boulder, CO, USA
Michael K. Patterson , Intel Corp., Hillsboro, OR, USA
Michael Oberg , Nat. Center for Atmos. Res., Boulder, CO, USA
Matthew Woitaszek , Nat. Center for Atmos. Res., Boulder, CO, USA
Guy Cobb , Univ. of Colorado, Boulder, Boulder, CO, USA
Robert Strong , Critical Facilities Technol., Arvada, CO, USA
Jim Gutowski , Dell, Inc., Round Rock, TX, USA
pp. 1-9
83 ms
(Ver 3.3 (11022016))