The Community for Technology Leaders
SC Conference (2011)
Seattle, Washington
Nov. 12, 2011 to Nov. 18, 2011
ISBN: 978-1-4503-0771-0
TABLE OF CONTENTS
Papers

SC 2011 Keynote (Abstract)

pp. i

First-principles calculations of electron states of a silicon nanowire with 100,000 atoms on the K computer (Abstract)

Atsushi Oshiyama , The University of Tokyo
Miwako Tsuji , University of Tsukuba
Ikuo Miyoshi , Next Generation Technical Computing Unit, Fujitsu Limited
Mitsuo Yokokawa , Next-Generation Supercomputer R&D Center, Riken
Jun-Ichi Iwata , University of Tsukuba
Fumiyoshi Shoji , Next-Generation Supercomputer R&D Center, Riken
Taisuke Boku , University of Tsukuba
Yukihiro Hasegawa , Next-Generation Supercomputer R&D Center, Riken
Atsuya Uno , Next-Generation Supercomputer R&D Center, Riken
Daisuke Takahashi , University of Tsukuba
Hikaru Inoue , Technical Computing Solution Unit, Fujitsu Limited
Kazuo Minami , Next-Generation Supercomputer R&D Center, Riken
Motoyoshi Kurokawa , Next-Generation Supercomputer R&D Center, Riken
pp. 1-11

Atomistic nanoelectronic device engineering with sustained performances up to 1.44 PFlop/s (Abstract)

Timothy B. Boykin , University of Alabama in Huntsville, Huntsville, AL
Mathieu Luisier , Purdue University, West Lafayette, IN
Wolfgang Fichtner , Integrated Systems Laboratory, ETH Zürich, Zürich, Switzerland
Gerhard Klimeck , Purdue University, West Lafayette, IN
pp. 1-11

Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer (Abstract)

Takashi Shimokawabe , Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, Japan
Tomohiro Takaki , Kyoto Institute of Technology, Gosyokaido-cyo, Matsugasaki, Sakyo-ku, Kyoto, Japan
Toshio Endo , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST
Takayuki Aoki , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST
Naoya Maruyama , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST
Satoshi Matsuoka , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST and National Institute of Informatics
Akinori Yamanaka , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
Akira Nukada , Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan and Japan Science and Technology Agency, CREST
pp. 1-11

Petaflop biofluidics simulations on a two million-core system (Abstract)

Toshio Endo , Tokyo Institute of Technology, Tokyo, Japan
Satoshi Matsuoka , Tokyo Institute of Technology, Tokyo, Japan
Massimiliano Fatica , Nvidia Corp., Santa Clara, CA
Massimo Bernaschi , CNR-IAC, Istituto Applicazioni, Calcolo, Consiglio Nazionale delle, Ricerche, Rome, Italy
Mauro Bisson , Harvard University, Cambridge, MA
Simone Melchionna , CNR-IPCF, Istituto Processi, Chimico-Fisici, Consiglio Nazionale delle, Ricerche, Rome, Italy
pp. 1-12

A new computational paradigm in multiscale simulations: application to brain blood flow (Abstract)

Joseph A. Insley , Argonne National Laboratory, Argonne, IL
Michael E. Papka , Argonne National Laboratory, Argonne, Illinois
Leopold Grinberg , Brown University, Providence, RI
Vitali Morozov , Argonne National Laboratory, Argonne, IL
Dmitry Fedosov , Institute of Complex Systems, FZ Juelich, Juelich, Germany
Kalyan Kumaran , Argonne National Laboratory, Argonne, Illinois
George Em Karniadakis , Brown University, Providence, RI
pp. 1-5

Optimizing symmetric dense matrix-vector multiplication on GPUs (Abstract)

Rajib Nath , University of California, San Diego
Tingxing "Tim" Dong , University of Tennessee, Knoxville
Jack Dongarra , University of Tennessee, Knoxville
Stanimire Tomov , University of Tennessee, Knoxville
pp. 1-10

Tiled QR factorization algorithms (Abstract)

Julien Langou , University of Colorado Denver
Henricus Bouwmeester , University of Colorado Denver
Yves Robert , ENS Lyon, France
Mathias Jacquelin , ENS Lyon, France
pp. 1-11

Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels (Abstract)

Jack Dongarra , University of Tennessee, Knoxville, TN
Azzam Haidar , University of Tennessee, Knoxville, TN
Hatem Ltaief , KAUST Supercomputing Laboratory, Thuwal, Saudi Arabia
pp. 1-11

Liszt: a domain specific language for building portable mesh-based PDE solvers (Abstract)

Alex Aiken , Stanford University
Zachary DeVito , Stanford University
Montserrat Medina , Stanford University
Frank Ham , Stanford University
Erich Elsen , Stanford University
Niels Joubert , Stanford University
Francisco Palacios , Stanford University
Karthik Duraisamy , Stanford University
Mike Barrientos , Stanford University
Eric Darve , Stanford University
Pat Hanrahan , Stanford University
Juan Alonso , Stanford University
Stephen Oakley , Stanford University
pp. 1-12

Simplified parallel domain traversal (Abstract)

David Erickson , Oak Ridge National Laboratory
Jian Huang , The University of Tennessee, Knoxville
Jingyuan Wang , The University of Tennessee, Knoxville
Wesley Kendall , The University of Tennessee, Knoxville
Melissa Allen , The University of Tennessee, Knoxville
Tom Peterka , Argonne National Laboratory
pp. 1-11

Physis: an implicitly parallel programming model for stencil computations on large-scale GPU-accelerated supercomputers (Abstract)

Kento Sato , Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, Japan
Naoya Maruyama , Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, Japan
Satoshi Matsuoka , Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, Japan
Tatsuo Nomura , Google, Inc., Roppongi, Minato-ku, Tokyo, Japan
pp. 1-12

CudaDMA: optimizing GPU memory bandwidth via warp specialization (Abstract)

Henry Cook , UC Berkeley
Michael Bauer , Stanford University
Brucek Khailany , NVIDIA Research
pp. 1-11

Dymaxion: optimizing memory access patterns for heterogeneous systems (Abstract)

Shuai Che , University of Virginia
Jeremy W. Sheaffer , University of Virginia
Kevin Skadron , University of Virginia
pp. 1-11

GROPHECY: GPU performance projection from CPU code skeletons (Abstract)

Venkatram Vishwanath , Argonne National Laboratory
Vitali A. Morozov , Argonne National Laboratory
Kalyan Kumaran , Argonne National Laboratory
Jiayuan Meng , Argonne National Laboratory
Thomas D. Uram , Argonne National Laboratory
pp. 1-11

Parallel random numbers: as easy as 1, 2, 3 (Abstract)

Mark A. Moraes , D. E. Shaw Research, New York, NY
John K. Salmon , D. E. Shaw Research, New York, NY
Ron O. Dror , D. E. Shaw Research, New York, NY
David E. Shaw , D. E. Shaw Research, New York, NY
pp. 1-12

Server-side I/O coordination for parallel file systems (Abstract)

Samuel Lang , Argonne National Laboratory, Argonne, IL
Xian-He Sun , Illinois Institute of Technology, Chicago, IL
Rajeev Thakur , Argonne National Laboratory, Argonne, IL
Huaiming Song , Illinois Institute of Technology, Chicago, IL
Yanlong Yin , Illinois Institute of Technology, Chicago, IL
pp. 1-11

QoS support for end users of I/O-intensive applications using shared storage systems (Abstract)

Song Jiang , Wayne State University, Detroit, MI
Kei Davis , Los Alamos National Laboratory, Los Alamos, NM
Xuechen Zhang , Wayne State University, Detroit, MI
pp. 1-12

Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems (Abstract)

Venkatram Vishwanath , Argonne National Laboratory, Argonne, IL
Mark Hereld , Argonne National Laboratory, Argonne, IL
Vitali Morozov , Argonne National Laboratory, Argonne, IL
Michael E. Papka , Argonne National Laboratory, Argonne, IL
pp. 1-11

GreenSlot: scheduling energy consumption in green datacenters (Abstract)

Jordi Torres , UPC/BSC
Íñigo Goiri , UPC/BSC and Rutgers Univ.
Ricardo Bianchini , Rutgers University
Jordi Guitart , UPC/BSC
Md. E. Haque , Rutgers University
Ryan Beauchea , Rutgers University
Thu D. Nguyen , Rutgers University
Kien Le , Rutgers University
pp. 1-11

A 'cool' load balancer for parallel applications (Abstract)

Osman Sarood , University of Illinois at Urbana-Champaign, Urbana, IL
Laxmikant V. Kale , University of Illinois at Urbana-Champaign, Urbana, IL
pp. 1-11

Reducing electricity cost through virtual machine placement in high performance computing clouds (Abstract)

Jingru Zhang , Rutgers University
Ricardo Bianchini , Rutgers University
Kien Le , Rutgers University
Yogesh Jaluria , Rutgers University
Thu D. Nguyen , Rutgers University
Jiandong Meng , Rutgers University
pp. 1-12

Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems (Abstract)

John Shalf , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
Samuel Williams , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
Stephane Ethier , Princeton Plasma Physics Laboratory, Princeton
Eun-Jin Im , Kookmin University, Seoul, Korea
Leonid Oliker , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
Khaled Z. Ibrahim , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
Kamesh Madduri , NERSC/CRD, Lawrence Berkeley National Laboratory, Berkeley
pp. 1-12

Unitary qubit lattice simulations of multiscale phenomena in quantum turbulence (Abstract)

Jeffrey Yepez , Air Force Research Laboratories, Hanscom AFB, MA
George Vahala , William & Mary, Williamsburg, VA
Bo Zhang , William & Mary, Williamsburg, VA
Linda Vahala , Old Dominion University, Norfolk, VA
Sean Ziegeler , High Performance Technologies, Inc., Reston, VA
Jonathan Carter , Computing Sciences, Lawrence Berkeley National Laboratory MS, Berkeley, CA
Min Soe , Rogers State University, Claremore, OK
pp. 1-11

An image compositing solution at scale (Abstract)

Tom Peterka , Argonne National Laboratory
Jian Huang , University of Tennessee, Knoxville
Kenneth Moreland , Sandia National Laboratories
Wesley Kendall , University of Tennessee, Knoxville
pp. 1-10
Papers

High-efficiency server design (Abstract)

Pierluigi Sarti , Facebook
Avery Nisbet , Facebook
Amir Michael , Facebook
Jacob Na , Facebook
Ali Heydari , Facebook
Harry Li , Facebook
pp. 1-27

Using the TOP500 to trace and project technology and architecture trends (Abstract)

Timothy J. Dysart , Univ. of Notre Dame, Fitzpatrick Hall, Notre Dame, IN
Peter M. Kogge , Univ. of Notre Dame, Fitzpatrick Hall, Notre Dame, IN
pp. 1-11

I/O streaming evaluation of batch queries for data-intensive computational turbulence (Abstract)

Eric Perlman , Johns Hopkins University, Baltimore, Maryland
Randal Burns , Johns Hopkins University, Baltimore, Maryland
Kalin Kanov , Johns Hopkins University, Baltimore, Maryland
Yanif Ahmad , Johns Hopkins University, Baltimore, Maryland
Alexander Szalay , Johns Hopkins University, Baltimore, Maryland
pp. 1-10

ISABELA-QA: query-driven analytics with ISABELA-compressed extreme-scale scientific data (Abstract)

Hemanth Kolla , Sandia National Laboratory, Livermore, CA
Sriram Lakshminarasimhan , North Carolina State University, NC and Oak Ridge National Laboratory, Oak Ridge, TN
Seung-Hoe Ku , New York University, New York, NY
Zhenhuan Gong , North Carolina State University, NC
Scott Klasky , Oak Ridge National Laboratory, Oak Ridge, TN
John Jenkins , North Carolina State University, NC and Oak Ridge National Laboratory, Oak Ridge, TN
Nagiza F. Samatova , North Carolina State University, NC and Oak Ridge National Laboratory, Oak Ridge, TN
Jackie Chen , Sandia National Laboratory, Livermore, CA
Robert Ross , Argonne National Laboratory, Argonne, IL
Robert Latham , Argonne National Laboratory, Argonne, IL
Isha Arkatkar , North Carolina State University, NC and Oak Ridge National Laboratory, Oak Ridge, TN
C. S. Chang , New York University, New York, NY
Stephane Ethier , Princeton Plasma Physics Laboratory, Princeton, NJ
pp. 1-11

FTI: high performance fault tolerance interface for hybrid systems (Abstract)

Naoya Maruyama , Tokyo Institute of Technology
Satoshi Matsuoka , Tokyo Institute of Technology
Dimitri Komatitsch , University of Toulouse
Seiji Tsuboi , JAMSTEC
Leonardo Bautista-Gomez , Tokyo Institute of Technology, INRIA
Franck Cappello , INRIA, University of Illinois
pp. 1-32

Checkpointing strategies for parallel jobs (Abstract)

Marin Bougeret , ENS Lyon, France
Henri Casanova , Univ. of Hawai'i at Manoa, Honolulu
Yves Robert , ENS Lyon, France
Mikael Rabie , ENS Lyon, France
Frédéric Vivien , INRIA, Lyon, France
pp. 1-11

BlobCR: efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots (Abstract)

Bogdan Nicolae , INRIA Saclay, ÃŽle-de-France, France
Franck Cappello , University of Illinois at Urbana Champaign
pp. 1-12

Fast implementation of DGEMM on Fermi GPU (Abstract)

Ninghui Sun , Key Laboratory of Computer Architecture, Institute of Computing Technology
Yungang Bao , Key Laboratory of Computer Architecture, Institute of Computing Technology
Guangming Tan , Key Laboratory of Computer Architecture, Institute of Computing Technology
Everett Phillips , Nvidia Corporation
Linchuan Li , Key Laboratory of Computer Architecture, Institute of Computing Technology
Sean Triechle , Nvidia Corporation
pp. 1-11

Scalable fast multipole methods on distributed heterogeneous architectures (Abstract)

Qi Hu , University of Maryland, College Park
Ramani Duraiswami , University of Maryland, College Park
Nail A. Gumerov , University of Maryland, College Park
pp. 1-12

Multi-science applications with single codebase - GAMER - for massively parallel architectures (Abstract)

Tak-Pong Woo , Soochow University, Taipei, Taiwan
Tzihong Chiueh , National Taiwan University, Taipei, Taiwan
Hsi-Yu Schive , National Taiwan University, Taipei, Taiwan
Hemant Shukla , Lawrence Berkeley National laboratory, Berkeley
pp. 1-11

Virtual I/O caching: dynamic storage cache management for concurrent workloads (Abstract)

Mahmut Kandemir , Pennsylvania State University, University Park, Pennsylvania
Michael Frasca , Pennsylvania State University, University Park, Pennsylvania
Padma Raghavan , Pennsylvania State University, University Park, Pennsylvania
Ramya Prabhakar , Pennsylvania State University, University Park, Pennsylvania
pp. 1-11

SCMFS: a file system for storage class memory (Abstract)

Xiaojian Wu , Texas A&M University
A. L. Narasimha Reddy , Texas A&M University
pp. 1-11

Optimized pre-copy live migration for memory intensive applications (Abstract)

Costin Iancu , Lawrence Berkeley National Laboratory, Berkeley
Khaled Z. Ibrahim , Lawrence Berkeley National Laboratory, Berkeley
Steven Hofmeyr , Lawrence Berkeley National Laboratory, Berkeley
Eric Roman , Lawrence Berkeley National Laboratory, Berkeley
pp. 1-11

Scalable hashing for shared memory supercomputers (Abstract)

Eric Goodman , Sandia National Laboratories, Albuquerque, NM
Edward Jimenez , Sandia National Laboratories, Albuquerque, NM
M. Nicole Lemaster , Sandia National Laboratories, Livermore, CA
pp. 1-11

An early performance analysis of POWER7-IH HPC systems (Abstract)

Adolfy Hoisie , Performance and Architecture Lab, Pacific Northwest National Laboratory, Richland, WA
Darren J. Kerbyson , Performance and Architecture Lab, Pacific Northwest National Laboratory, Richland, WA
Kevin J. Barker , Performance and Architecture Lab, Pacific Northwest National Laboratory, Richland, WA
pp. 1-11

A similarity measure for time, frequency, and dependencies in large-scale workloads (Abstract)

Angelos Molfetas , European Organisation for Nuclear Research, Geneva, Switzerland
Martin Barisits , European Organisation for Nuclear Research, Geneva, Switzerland
Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
Mario Lassnig , University of Innsbruck, Innsbruck, Austria
Vincent Garonne , European Organisation for Nuclear Research, Geneva, Switzerland
pp. 1-11

Evaluating the viability of process replication reliability for exascale systems (Abstract)

Ron Brightwell , Sandia National Laboratories
Kevin Pedretti , Sandia National Laboratories
Dorian Arnold , University of New Mexico
Patrick G. Bridges , University of New Mexico
James H. Laros , Sandia National Laboratories
Ron Oldfield , Sandia National Laboratories
Jon Stearley , Sandia National Laboratories
Rolf Riesen , IBM Research, Ireland
Kurt Ferreira , Sandia National Laboratories
pp. 1-12

Modeling and tolerating heterogeneous failures in large parallel systems (Abstract)

Ana Gainaru , University Politehnica of Bucharest
Bill Kramer , NCSA, UIUC, Urbana, IL
Derrick Kondo , INRIA, France
Eric Heien , INRIA, France
Dan LaPine , UIUC, NCSA, Urbana, IL
Franck Cappello , INRIA, France, UIUC, Urbana, IL
pp. 1-11

System implications of memory reliability in exascale computing (Abstract)

Sheng Li , Hewlett-Packard Labs
Ke Chen , University of Notre Dame and Hewlett-Packard Labs
Arun F. Rodrigues , Sandia National Labs
Naveen Muralimanohar , Hewlett-Packard Labs
Chad D. Kersey , Georgia Institute of Technology
Ming-Yu Hsieh , Sandia National Labs
Norman P. Jouppi , Hewlett-Packard Labs
Jay B. Brockman , University of Notre Dame
pp. 1-12

Flexible resource allocation for reliable virtual cluster computing systems (Abstract)

Kanak Mahadik , Purdue University, West Lafayette, IN
Thomas J. Hacker , Purdue University, West Lafayette, IN
pp. 1-12

Auto-scaling to minimize cost and meet application deadlines in cloud workflows (Abstract)

Marty Humphrey , University of Virginia, Charlottesville, VA
Ming Mao , University of Virginia, Charlottesville, VA
pp. 1-12

Large scale debugging of parallel tasks with AutomaDeD (Abstract)

Barry Rountree , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Martin Schulz , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Greg Bronevetsky , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Ignacio Laguna , Purdue University, West Lafayette, IN
Saurabh Bagchi , Purdue University, West Lafayette, IN
Todd Gamblin , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Bronis R. de Supinski , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
Dong H. Anh , Lawrence Livermore National Laboratory, Computation Directorate, Livermore, CA
pp. 1-10

Efficient data race detection for distributed memory parallel programs (Abstract)

Paul Hargrove , Lawrence Berkeley National Laboratory
Costin Iancu , Lawrence Berkeley National Laboratory
Chang-Seo Park , University of California, Berkeley
Koushik Sen , University of California, Berkeley
pp. 1-12

Sniper: exploring the level of abstraction for scalable and accurate parallel multi-core simulation (Abstract)

Lieven Eeckhout , Ghent University, Belgium
Wim Heirman , Ghent University, Belgium and Intel ExaScience Lab, Leuven, Belgium
Trevor E. Carlson , Ghent University, Belgium and Intel ExaScience Lab, Leuven, Belgium
pp. 1-12

Performance of the community earth system model (Abstract)

Arthur A. Mirin , Lawrence Livermore National Laboratory, Livermore, CA
John M. Dennis , National Center for Atmospheric Research, CO
Anthony P. Craig , National Center for Atmospheric Research, CO
Mark A. Taylor , Sandia National Laboratories, Albuquerque, NM
Patrick H. Worley , Oak Ridge National Laboratory, Oak Ridge, TN
Mariana Vertenstein , National Center for Atmospheric Research, CO
pp. 1-11

Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning (Abstract)

Jonathan Carter , Lawrence Berkeley National Laboratory
John Shalf , Lawrence Berkeley National Laboratory
Leonid Oliker , Lawrence Berkeley National Laboratory
Samuel Williams , Lawrence Berkeley National Laboratory
pp. 1-12

Highly scalable ab initio genomic motif identification (Abstract)

Benoît Marchand , King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Vladimir B. Bajic , King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Dinesh K. Kaushik , King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
pp. 1-10

Hadoop acceleration through network levitated merge (Abstract)

Yandong Wang , Auburn University
Dror Goldenberg , Mellanox Technologies
Dhiraj Sehgal , Mellanox Technologies
Weikuan Yu , Auburn University
Xinyu Que , Auburn University
pp. 1-10

Purlieus: locality-aware resource allocation for MapReduce in a cloud (Abstract)

Ling Liu , College of Computing Georgia Tech
Aameek Singh , IBM Research - Almaden
Balaji Palanisamy , College of Computing Georgia Tech
Bhushan Jain , IBM India Software Lab
pp. 1-11

A distributed look-up architecture for text mining applications using MapReduce (Abstract)

Ian Foster , University of Chicago, Chicago, IL
Atilla Soner Balkir , University of Chicago, Chicago, IL
Andrey Rzhetsky , University of Chicago, Chicago, IL
pp. 1-11

Copernicus: a new paradigm for parallel adaptive molecular dynamics (Abstract)

Berk Hess , Royal Institute of Technology, Stockholm, Sweden
Per Larsson , University of Virginia, Charlottesville, VA
Erik Lindahl , Royal Institute of Technology, Stockholm, Sweden
Sander Pronk , Royal Institute of Technology, Stockholm, Sweden
Imran S. Haque , Stanford University, Stanford, CA
Kyle Beauchamp , Stanford University, Stanford, CA
Vijay S. Pande , Stanford University, Stanford, CA
Gregory R. Bowman , Stanford University, Stanford, CA
Iman Pouya , Royal Institute of Technology, Stockholm, Sweden
Peter M. Kasson , University of Virginia, Charlottesville, VA
pp. 1-10

Enabling and scaling biomolecular simulations of 100 million atoms on petascale machines with a multicore-optimized message-driven runtime (Abstract)

Laxmikant V. Kale , University of Illinois at Urbana-Champaign, Urbana, IL
Chris Harrison , University of Illinois at Urbana-Champaign, Urbana, IL
James C. Phillips , University of Illinois at Urbana-Champaign, Urbana, IL
Chao Mei , University of Illinois at Urbana-Champaign, Urbana, IL
Eric J. Bohm , University of Illinois at Urbana-Champaign, Urbana, IL
Yanhua Sun , University of Illinois at Urbana-Champaign, Urbana, IL
Gengbin Zheng , University of Illinois at Urbana-Champaign, Urbana, IL
pp. 1-11

Parallelization design on multi-core platforms in density matrix renormalization group toward 2-D quantum strongly-correlated systems (Abstract)

Masahiko Machida , Japan Atomic Energy Agency, Kashiwanoha, Kashiwa-shi, Chiba, Japan
Toshiyuki Imamura , The University of Electro-Communications, Chofugaoka, Chofu-shi, Tokyo, Japan
Susumu Yamada , Japan Atomic Energy Agency, Kashiwanoha, Kashiwa-shi, Chiba, Japan
pp. 1-10

A scalable eigensolver for large scale-free graphs using 2D graph partitioning (Abstract)

Andy Yoo , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory
Allison H. Baker , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory
Van Emden Henson , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory
Roger Pearce , Texas A&M University
pp. 1-11

Scalable stochastic optimization of complex energy systems (Abstract)

Miles Lubin , Argonne National Laboratory, Argonne, IL
Victor Zavala , Argonne National Laboratory, Argonne, IL
Mihai Anitescu , Argonne National Laboratory, Argonne, IL
Cosmin G. Petra , Argonne National Laboratory, Argonne, IL
pp. 1-64

Parallel breadth-first search on distributed memory systems (Abstract)

Kamesh Madduri , Lawrence Berkeley National Laboratory, Berkeley, CA
Aydin Buluç , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-12

SciHadoop: array-based query processing in Hadoop (Abstract)

Joe B. Buck , UC Santa Cruz
Jeff LeFevre , UC Santa Cruz
Neoklis Polyzotis , UC Santa Cruz
Kleoni Ioannidou , UC Santa Cruz
Scott Brandt , UC Santa Cruz
Noah Watkins , UC Santa Cruz
Carlos Maltzahn , UC Santa Cruz
pp. 1-11

On the duality of data-intensive file system design: reconciling HDFS and PVFS (Abstract)

Wittawat Tantisiriroj , Carnegie Mellon University
Robert B. Ross , Argonne National Laboratory
Swapnil Patil , Carnegie Mellon University
Samuel J. Lang , Argonne National Laboratory
Seung Woo Son , Carnegie Mellon University
Garth Gibson , Argonne National Laboratory
pp. 1-12

End-to-end network QoS via scheduling of flexible resource reservation requests (Abstract)

Dimitrios Katramatos , Computational Science Center, Brookhaven National Laboratory, Upton, NY
Dantong Yu , Computational Science Center, Brookhaven National Laboratory, Upton, NY
Sushant Sharma , Computational Science Center, Brookhaven National Laboratory, Upton, NY
pp. 1-10

High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach (Abstract)

Michael A. Clark , Harvard-Smithsonian Center for Astrophysics
Mikhail Smelyanskiy , Parallel Computing Labs, Intel
Pradeep Dubey , Parallel Computing Labs, Intel
Jee Choi , Georgia Institute of Technology
Bálint Joó , Thomas Jefferson National Accelerator
Karthikeyan Vaidyanathan , Parallel Computing Labs, Intel
Jatin Chhugani , Parallel Computing Labs, Intel
pp. 1-11

Scaling lattice QCD beyond 100 GPUs (Abstract)

S. Gottlieb , Indiana University, Bloomington, IN
M. A. Clark , Harvard-Smithsonian Center for Astrophysics, Cambridge, MA
B. Joó , Thomas Jefferson National, Newport News, VA
G. Shi , University of Illinois, Urbana, IL
R. C. Brower , Boston University, Boston, MA
R. Babich , Boston University, Boston, MA
pp. 1-11

Large scale plane wave pseudopotential density functional theory calculations on GPU clusters (Abstract)

Weile Jia , Supercomputing Center of Computer, Network Information Center, Chinese Academy of Sciences, ZhongGuanCun, Beijing, China
Lin-Wang Wang , Lawrence, Berkeley National Laboratory, Berkeley, CA
Long Wang , Supercomputing Center of Computer, Network Information Center, Chinese Academy of Sciences, ZhongGuanCun, Beijing, China
Yue Wu , Fudan University, Shanghai, China
Weiguo Gao , Fudan University, Shanghai, China
Xuebin Chi , Supercomputing Center of Computer, Network Information Center, Chinese Academy of Sciences, ZhongGuanCun, Beijing, China
pp. 1-10

Scalable implementations of accurate excited-state coupled cluster theories: application of high-level methods to porphyrin-based systems (Abstract)

Ryan M. Olson , Cray, Incorporated, MN
Vinod Tipparaju , Oak Ridge National Laboratory, Oak Ridge, TN
E. Aprà , Oak Ridge National Laboratory, Oak Ridge, TN
Sriram Krishnamoorthy , Pacific Northwest National Laboratory, Richland, WA
Karol Kowalski , Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA
pp. 1-10

Hardware/software co-design for energy-efficient seismic modeling (Abstract)

Samuel Williams , Lawrence Berkeley National Laboratory, Berkeley, CA
Jens Krueger , Lawrence Berkeley National Laboratory, Berkeley, CA and Fraunhofer ITWM, Kaiserslautern, Germany
David Donofrio , Lawrence Berkeley National Laboratory, Berkeley, CA
Franz-Josef Pfreund , Fraunhofer ITWM, Kaiserslautern, Germany
John Shalf , Lawrence Berkeley National Laboratory, Berkeley, CA
Marghoob Mohiyuddin , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
Leonid Oliker , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-12

Optimizing the Barnes-Hut algorithm in UPC (Abstract)

Junchao Zhang , University of Illinois at Urbana-Champaign
Babak Behzad , University of Illinois at Urbana-Champaign
Marc Snir , University of Illinois at Urbana-Champaign
pp. 1-11

Avoiding hot-spots on two-level direct networks (Abstract)

Laxmikant V. Kale , University of Illinois at Urbana-Champaign, Urbana, IL
Nikhil Jain , University of Illinois at Urbana-Champaign, Urbana, IL
William D. Gropp , University of Illinois at Urbana-Champaign, Urbana, IL
Abhinav Bhatele , Lawrence Livermore National Laboratory, Livermore, CA
pp. 1-11

Improving communication performance in dense linear algebra via topology aware collectives (Abstract)

Abhinav Bhatele , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA
James Demmel , University of California at Berkeley, Berkeley, CA
Edgar Solomonik , University of California at Berkeley, Berkeley, CA
pp. 1-11

Multithreaded Global Address Space Communication Techniques for Gyrokinetic Fusion Applications on Ultra-Scale Platforms (Abstract)

Stephane Ethier , Princeton Plasma Physics Laboratory Princeton, NJ
Robert Preissl , Lawrence Berkeley National Laboratory Berkeley, CA
Bill Long , CRAY Inc. St. Paul, MN
John Shalf , Lawrence Berkeley National Laboratory Berkeley, CA
Alice Koniges , Lawrence Berkeley National Laboratory Berkeley, CA
Nathan Wichmann , CRAY Inc. St. Paul, MN
pp. 1-11

Deep and wide metrics for HPC resource capability and project usage (Abstract)

David Hart , Nat. Center for Atmos. Res., Boulder, CO, USA
pp. 1-7

Challenges in the management of high-performance computing centers: An organizational perspective (Abstract)

Nicholas Berente , Terry Coll. of Bus., Univ. of Georgia, Athens, GA, USA
Jennifer Claggett , Terry Coll. of Bus., Univ. of Georgia, Athens, GA, USA
pp. 1-8

Integrating multi-touch in high-resolution display environments (Abstract)

Brandt Westing , Texas Adv. Comput. Center, Austin, TX, USA
Benjamin Urick , Texas Adv. Comput. Center, Austin, TX, USA
Maria Esteva , Texas Adv. Comput. Center, Austin, TX, USA
Freddy Rojas , Texas Adv. Comput. Center, Austin, TX, USA
Weijia Xu , Texas Adv. Comput. Center, Austin, TX, USA
pp. 1-9

How to measure useful, sustained performance (Abstract)

William Kramer , Nat. Center for Supercomput. Applic., Univ. of Illinois, Urbana, IL, USA
pp. 1-18

Performance modeling for systematic performance tuning (Abstract)

Torsten Hoefler , Nat. Center for Supercomput. Applic., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
William Gropp , Nat. Center for Supercomput. Applic., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Marc Snir , Nat. Center for Supercomput. Applic., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
William Kramer , Nat. Center for Supercomput. Applic., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
pp. 1-12

A long-distance infiniband interconnection between two clusters in production use (Abstract)

Sabine Richling , IT-Center, Univ. of Heidelberg, Heidelberg, Germany
Steffen Hau , IT-Center, Univ. of Mannheim, Mannheim, Germany
Heinz Kredel , IT-Center, Univ. of Mannheim, Mannheim, Germany
Hans-Gunther Kruse , IT-Center, Univ. of Mannheim, Mannheim, Germany
pp. 1-8

SPOTlight on testing: Stability, performance and operational testing of LANL HPC clusters (Abstract)

Georgia Pedicini , High Performance Comput. Syst., Los Alamos Nat. Lab., Los Alamos, NM, USA
Jennifer Green , High Performance Comput. Syst., Los Alamos Nat. Lab., Los Alamos, NM, USA
pp. 1-8

A Toolkit for Event Analysis and Logging (Abstract)

James Carey , IBM, Rochester, MN, USA
Philip Sanders , IBM, Rochester, MN, USA
pp. 1-7

Best practices for the deployment and management of production HPC clusters (Abstract)

Robert McLay , Texas Adv. Comput. Center (TACC), Univ. of Texas at Austin, Austin, TX, USA
Karl W. Schulz , Texas Adv. Comput. Center (TACC), Univ. of Texas at Austin, Austin, TX, USA
William L. Barth , Texas Adv. Comput. Center (TACC), Univ. of Texas at Austin, Austin, TX, USA
Tommy Minyard , Texas Adv. Comput. Center (TACC), Univ. of Texas at Austin, Austin, TX, USA
pp. 1-11

Logjam: A scalable unified log file archiver (Abstract)

Nicholas P. Cardo , Lawrence Berkeley Nat. Lab., Nat. Energy Res. Sci. Comput. Center, Berkeley, CA, USA
pp. 1-9

The NWSC benchmark suite: Using scientific throughput to measure supercomputer performance (Abstract)

Rory C. Kelly , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
Siddartha S. Ghosh , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
Si Liu , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
Davide Del Vento , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
Richard A. Valent , Comput. & Inf. Syst. Lab., Nat. Center for Atmos. Res., Boulder, CO, USA
pp. 1-5

Qserv: A distributed shared-nothing database for the LSST catalog (Abstract)

Daniel L. Wang , SLAC Nat. Accel. Lab., Menlo Park, CA, USA
Serge M. Monkewitz , Infrared Process. & Anal. Center, California Inst. of Technol., Pasadena, CA, USA
Kian-Tat Lim , SLAC Nat. Accel. Lab., Menlo Park, CA, USA
Jacek Becla , SLAC Nat. Accel. Lab., Menlo Park, CA, USA
pp. 1-11

Challenges of HPC monitoring (Abstract)

William Allcock , Argonne Nat. Lab., Argonne, IL, USA
Evan Felix , Pacific Northwest Nat. Lab., Richland, WA, USA
Mike Lowe , Indiana Univ., Bloomington, IN, USA
Randal Rheinheimer , Los Alamos Nat. Lab., Los Alamos, NM, USA
Joshi Fullop , Nat. Center for Supercomput. Applic., USA
pp. 1-6

Adaptive simulation of turbulent flow past a full car model (Abstract)

Niclas Jansson , Comput. Technol. Lab., KTH R. Inst. of Tech., Stockholm, Sweden
Johan Hoffman , Comput. Technol. Lab., KTH R. Inst. of Tech., Stockholm, Sweden
Murtazo Nazarov , Comput. Technol. Lab., KTH R. Inst. of Tech., Stockholm, Sweden
pp. 1-8

World-highest resolution global atmospheric model and its performance on the Earth Simulator (Abstract)

Keiko Takahashi , Japan Agency for Marine-Earth Sci. & Technol., Yokosuka, Japan
Ken'ichi Itakura , Japan Agency for Marine-Earth Sci. & Technol., Yokosuka, Japan
Satom Okura , Japan Agency for Marine-Earth Sci. & Technol., Yokosuka, Japan
Kiinihiko Watanabe , Japan Agency for Marine-Earth Sci. & Technol., Yokosuka, Japan
pp. 1-12

A survey of the practice of computational science (Abstract)

Prakash Prabhu , Princeton Univ., Princeton, NJ, USA
Thomas B. Jablin , Princeton Univ., Princeton, NJ, USA
Arun Raman , Princeton Univ., Princeton, NJ, USA
Yun Zhang , Princeton Univ., Princeton, NJ, USA
Jialu Huang , Princeton Univ., Princeton, NJ, USA
Hanjun Kim , Princeton Univ., Princeton, NJ, USA
Nick P. Johnson , Princeton Univ., Princeton, NJ, USA
Feng Liu , Princeton Univ., Princeton, NJ, USA
Soumyadeep Ghosh , Princeton Univ., Princeton, NJ, USA
Stephen Beard , Princeton Univ., Princeton, NJ, USA
Taewook Oh , Princeton Univ., Princeton, NJ, USA
Matthew Zoufaly , Princeton Univ., Princeton, NJ, USA
David Walker , Princeton Univ., Princeton, NJ, USA
David I. August , Princeton Univ., Princeton, NJ, USA
pp. 1-12

Performance evaluations of gyrokinetic Eulerian code GT5D on massively parallel multi-core platforms (Abstract)

Yasuhiro Idomura , Japan Atomic Energy Agency, Tokyo, Japan
Sebastien Jolliet , Japan Atomic Energy Agency, Tokyo, Japan
pp. 1-9

Janus: Co-designing HPC systems and facilities (Abstract)

Henry M. Tufo , Univ. of Colorado, Boulder, Boulder, CO, USA
Michael K. Patterson , Intel Corp., Hillsboro, OR, USA
Michael Oberg , Nat. Center for Atmos. Res., Boulder, CO, USA
Matthew Woitaszek , Nat. Center for Atmos. Res., Boulder, CO, USA
Guy Cobb , Univ. of Colorado, Boulder, Boulder, CO, USA
Robert Strong , Critical Facilities Technol., Arvada, CO, USA
Jim Gutowski , Dell, Inc., Round Rock, TX, USA
pp. 1-9
97 ms
(Ver 3.3 (11022016))