The Community for Technology Leaders
SC Conference (2008)
Austin, Texas
Nov. 15, 2008 to Nov. 21, 2008
ISBN: 978-1-4244-2835-9
TABLE OF CONTENTS
Front Matter

Front Matter (PDF)

pp. i-iii
Papers

Entering the petaflop era: the architecture and performance of Roadrunner (Abstract)

Mike Lang , Los Alamos National Laboratory, Los Alamos
Adolfy Hoisie , Los Alamos National Laboratory, Los Alamos
Kei Davis , Los Alamos National Laboratory, Los Alamos
Scott Pakin , Los Alamos National Laboratory, Los Alamos
Darren J. Kerbyson , Los Alamos National Laboratory, Los Alamos
Jose C. Sancho , Los Alamos National Laboratory, Los Alamos
Kevin J. Barker , Los Alamos National Laboratory, Los Alamos
pp. 1-11

High performance discrete Fourier transforms on graphics processors (Abstract)

John Manferdelli , Microsoft Corporation
Naga K. Govindaraju , Microsoft Corporation
Yuri Dotsenko , Microsoft Corporation
Brandon Lloyd , Microsoft Corporation
Burton Smith , Microsoft Corporation
pp. 1-12

Dynamically adapting file domain partitioning methods for collective I/O based on underlying parallel file system locking protocols (Abstract)

Wei-keng Liao , Northwestern University, Evanston, Illinois
Alok Choudhary , Northwestern University, Evanston, Illinois
pp. 1-12

Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures (Abstract)

Leonid Oliker , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
David Patterson , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
Kaushik Datta , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
John Shalf , Lawrence Berkeley National Laboratory, Berkeley, CA
Samuel Williams , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
Katherine Yelick , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
Mark Murphy , University of California at Berkeley, Berkeley, CA
Vasily Volkov , University of California at Berkeley, Berkeley, CA
Jonathan Carter , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-12

Bandwidth intensive 3-D FFT kernel for GPUs using CUDA (Abstract)

Akira Nukada , Tokyo Institute of Technology, Tokyo, Japan and Japan Science and Technology Agency, Kawaguchi, Saitama, Japan
Yasuhiko Ogata , Tokyo Institute of Technology, Tokyo, Japan and Japan Science and Technology Agency, Kawaguchi, Saitama, Japan
Toshio Endo , Tokyo Institute of Technology, Tokyo, Japan and Japan Science and Technology Agency, Kawaguchi, Saitama, Japan
Satoshi Matsuoka , Tokyo Institute of Technology, Tokyo, Japan and National Institute of Informatics, Tokyo, Japan and Japan Science and Technology Agency, Kawaguchi, Saitama, Japan
pp. 1-11

Using server-to-server communication in parallel file systems to simplify consistency and improve performance (Abstract)

Walter B. Ligon , Clemson University, Clemson, SC
Bradley W. Settlemyer , Clemson University, Clemson, SC
Philip H. Carns , Argonne National Laboratory, Argonne, IL
pp. 1-8

Scientific application-based performance comparison of SGI Altix 4700, IBM POWER5+, and SGI ICE 8200 supercomputers (Abstract)

Haoqiang Jin , NASA Ames Research Center, California
Dennis Jespersen , NASA Ames Research Center, California
Dale Talcott , NASA Ames Research Center, California
Subhash Saini , NASA Ames Research Center, California
Rupak Biswas , NASA Ames Research Center, California
Jahed Djomehri , NASA Ames Research Center, California
pp. 1-12

Adapting a message-driven parallel application to GPU-accelerated clusters (Abstract)

Klaus Schulten , University of Illinois at Urbana-Champaign, Urbana, IL
James C. Phillips , University of Illinois at Urbana-Champaign, Urbana, IL
John E. Stone , University of Illinois at Urbana-Champaign, Urbana, IL
pp. 1-9

Scaling parallel I/O performance through I/O delegate and caching system (Abstract)

Wei-keng Liao , Northwestern University, Evanston, Illinois
Arifa Nisar , Northwestern University, Evanston, Illinois
Alok Choudhary , Northwestern University, Evanston, Illinois
pp. 1-12

Efficient management of data center resources for massively multiplayer online games (Abstract)

Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
Radu Prodan , University of Innsbruck, Innsbruck, Austria
Alexandru Iosup , Delft University of Technology, Delft, The Netherlands
Vlad Nae , University of Innsbruck, Innsbruck, Austria
Stefan Podlipnig , University of Innsbruck, Innsbruck, Austria
Dick Epema , Delft University of Technology, Delft, The Netherlands
pp. 1-12

Performance optimization of TCP/IP over 10 gigabit ethernet by precise instrumentation (Abstract)

Yutaka Sugawara , The University of Tokyo
Takeshi Yoshino , Google Japan Inc.
Mary Inaba , The University of Tokyo
Junji Tamatsukuri , The University of Tokyo
Kei Hiraki , The University of Tokyo
Katsushi Inagami , The University of Tokyo
pp. 1-12

A multi-level parallel simulation approach to electron transport in nano-scale transistors (Abstract)

Mathieu Luisier , Purdue University, West Lafayette, IN
Gerhard Klimeck , Purdue University, West Lafayette, IN
pp. 1-10

Feedback-controlled resource sharing for predictable eScience (Abstract)

Sang-Min Park , University of Virginia, Charlottesville, VA
Marty Humphrey , University of Virginia, Charlottesville, VA
pp. 1-11

Wide-area performance profiling of 10GigE and InfiniBand technologies (Abstract)

William R. Wing , Oak Ridge National Laboratory, Oak Ridge, TN
Stephen W. Poole , Oak Ridge National Laboratory, Oak Ridge, TN
Nageswara S. V. Rao , Oak Ridge National Laboratory, Oak Ridge, TN
Weikuan Yu , Oak Ridge National Laboratory, Oak Ridge, TN
Jeffrey S. Vetter , Oak Ridge National Laboratory, Oak Ridge, TN
pp. 1-12

Accelerating configuration interaction calculations for nuclear structure (Abstract)

Philip Sternberg , Lawrence Berkeley National Laboratory, Berkeley, CA
James P. Vary , Iowa State University, Ames, IA
Masha Sosonkina , Iowa State University, Ames, IA
Pieter Maris , Iowa State University, Ames, IA
Esmond G. Ng , Lawrence Berkeley National Laboratory, Berkeley, CA
Chao Yang , Lawrence Berkeley National Laboratory, Berkeley, CA
Hung Viet Le , Iowa State University, Ames, IA
pp. 1-12

Efficient auction-based grid reservations using dynamic programming (Abstract)

Rich Wolski , University of California Santa Barbara, Santa Barbara, CA
Andrew Mutz , University of California Santa Barbara, Santa Barbara, CA
pp. 1-8

Asymmetric interactions in symmetric multi-core systems: analysis, enhancements and evaluation (Abstract)

T. Scogland , Virginia Tech
P. Balaji , Argonne National Lab
G. Narayanaswamy , Virginia Tech
W. Feng , Virginia Tech
pp. 1-12

Dendro: parallel algorithms for multigrid and AMR methods on 2:1 balanced octrees (Abstract)

Rahul S. Sampath , Georgia Institute of Technology, Atlanta, GA
George Biros , Georgia Institute of Technology, Atlanta, GA
Hari Sundar , University of Pennsylvania, Philadelphia, PA
Ilya Lashuk , Georgia Institute of Technology, Atlanta, GA
Santi S. Adavani , University of Pennsylvania, Philadelphia, PA
pp. 1-12

Characterizing application sensitivity to OS interference using kernel-level noise injection (Abstract)

Ron Brightwell , Sandia National Laboratories, Albuquerque, NM
Patrick Bridges , The University of New Mexico, Albuquerque, NM
Kurt B. Ferreira , The University of New Mexico, Albuquerque, NM
pp. 1-12

Performance prediction of large-scale parallell system and application using macro-level simulation (Abstract)

Hidemi Komatsu , Fujitsu, Tokyo, Japan
Hisashige Ando , Fujitsu, Tokyo, Japan
Hiroaki Honda , Information Technologies & Nanotechnologies, Fukuoka, Japan
Yuichi Inadomi , Information Technologies & Nanotechnologies, Fukuoka, Japan
Mutsumi Aoyagi , Kyushu University, Fukuoka, Japan
Yunqing Yu , Kyushu University, Fukuoka, Japan
Ryutaro Susukita , Information Technologies & Nanotechnologies, Fukuoka, Japan
Hidetomo Shibamura , Information Technologies & Nanotechnologies, Fukuoka, Japan
Koji Inoue , Kyushu University, Fukuoka, Japan
Shigeru Ishizuki , Fujitsu, Tokyo, Japan
Shuji Yamamura , Fujitsu, Tokyo, Japan
Motoyoshi Kurokawa , RIKEN (The Institute of Physical & Chemical Research), Wako, Japan
Kazuaki J. Murakami , Kyushu University, Fukuoka, Japan
Yasunori Kimura , Fujitsu, Tokyo, Japan
pp. 1-9

A novel domain oriented approach for scientific grid workflow composition (Abstract)

Jun Qin , University of Innsbruck, Innsbruck, Austria
Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
pp. 1-12

Toward loosely coupled programming on petascale systems (Abstract)

Ben Clifford , University of Chicago and Argonne National Laboratory, Chicago, IL
Ioan Raicu , University of Chicago, Chicago, IL
Mike Wilde , Argonne National Laboratory, Argonne, IL and University of Chicago and Argonne National Laboratory, Chicago, IL
Kamil Iskra , Argonne National Laboratory, Argonne, IL
Pete Beckman , Argonne National Laboratory, Argonne, IL
Ian Foster , Argonne National Laboratory, Argonne, IL and University of Chicago, Chicago, IL and University of Chicago and Argonne National Laboratory, Chicago, IL
Zhao Zhang , University of Chicago and Argonne National Laboratory, Chicago, IL
pp. 1-12

Early evaluation of IBM BlueGene/P (Abstract)

P. Worley , Oak Ridge National Laboratory, Oak Ridge, TN
M. Bast , Oak Ridge National Laboratory, Oak Ridge, TN
W. Yu , Oak Ridge National Laboratory, Oak Ridge, TN
P. Roth , Oak Ridge National Laboratory, Oak Ridge, TN
S. Alam , Oak Ridge National Laboratory, Oak Ridge, TN
M. R. Fahey , Oak Ridge National Laboratory, Oak Ridge, TN
J. S. Vetter , Oak Ridge National Laboratory, Oak Ridge, TN
J. Rogers , Oak Ridge National Laboratory, Oak Ridge, TN
J. Kuehn , Oak Ridge National Laboratory, Oak Ridge, TN
C. McCurdy , Oak Ridge National Laboratory, Oak Ridge, TN
R. Barrett , Oak Ridge National Laboratory, Oak Ridge, TN
R. Sankaran , Oak Ridge National Laboratory, Oak Ridge, TN
pp. 1-12

Nimrod/K: towards massively parallel dynamic grid workflows (Abstract)

Colin Enticott , Monash University, Victoria, Australia
David Abramson , Monash University, Victoria, Australia
Ilkay Altinas , San Diego Supercomputer Center, La Jolla, CA
pp. 1-11

SMARTMAP: operating system support for efficient data sharing among processes on a multi-core processor (Abstract)

Ron Brightwell , Sandia National Laboratories, Albuquerque, New Mexico
Kevin Pedretti , Sandia National Laboratories, Albuquerque, New Mexico
Trammell Hudson , Operating Systems Research, Washington, DC
pp. 1-12

Lessons learned at 208K: towards debugging millions of cores (Abstract)

Bronis R. de Supinski , Lawrence Livermore National Laboratory, Livermore, CA
Dong H. Ahn , Lawrence Livermore National Laboratory, Livermore, CA
Martin Schulz , Lawrence Livermore National Laboratory, Livermore, CA
Barton P. Miller , University of Wisconsin, Madison, WI
Matthew Legendre , University of Wisconsin, Madison, WI
Gregory L. Lee , Lawrence Livermore National Laboratory, Livermore, CA
Dorian C. Arnold , University of Wisconsin, Madison, WI
Ben Liblit , University of Wisconsin, Madison, WI
pp. 1-9

Applying double auctions for scheduling of workflows on the Grid (Abstract)

Marek Wieczorek , University of Innsbruck, Innsbruck, Austria
Radu Prodan , University of Innsbruck, Innsbruck, Austria
Stefan Podlipnig , University of Innsbruck, Innsbruck, Austria
Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
pp. 1-11

A novel migration-based NUCA design for chip multiprocessors (Abstract)

Feihui Li , NVIDIA
Mahmut Kandemir , Pennsylvania State University
Mary Jane Irwin , Pennsylvania State University
Seung Woo Son , Pennsylvania State University
pp. 1-12

Communication avoiding Gaussian elimination (Abstract)

James W. Demmel , UC Berkeley, CA
Laura Grigori , Universite Paris-Sud, Orsay France
Hua Xiang , Universite Paris-Sud, Orsay France
pp. 1-12

Extending CC-NUMA systems to support write update optimizations (Abstract)

Liqun Cheng , Intel Corp. and University of Utah
John B. Carter , IBM Austin Research Laboratory and University of Utah
pp. 1-12

Benchmarking GPUs to tune dense linear algebra (Abstract)

James W. Demmel , University of California at Berkeley
Vasily Volkov , University of California at Berkeley
pp. 1-11

High-radix crossbar switches enabled by proximity communication (Abstract)

Wladek Olesinski , Sun Microsystems, Menlo Park, CA
Hans Eberle , Sun Microsystems, Menlo Park, CA
José Flich , Universidad Politécnica de Valencia, Valencia, Spain
José Duato , Universidad Politécnica de Valencia, Valencia, Spain
Nils Gura , Sun Microsystems, Menlo Park, CA
Pedro J. Garcia , Universidad de Castilla-La Mancha, Albacete, Spain
Robert Drost , Sun Microsystems, Menlo Park, CA
David Hopkins , Sun Microsystems, Menlo Park, CA
pp. 1-12

Massively parallel genomic sequence search on the Blue Gene/P architecture (Abstract)

Carlos Sosa , University of Minnesota, Minneapolis, MN
Heshan Lin , North Carolina State University
Pavan Balaji , Argonne National Laboratory
Xiaosong Ma , North Carolina State University
Wu-chun Feng , Virginia Tech
pp. 1-11

The role of MPI in development time: a case study (Abstract)

Lynn B. Reid , University of Chicago
Lorin Hochstein , USC Information Sciences Institute
Forrest Shull , Fraunhofer Center Maryland
pp. 1-10

An efficient parallel approach for identifying protein families in large-scale metagenomic data sets (Abstract)

Changjun Wu , Washington State University, Pullman, WA
Ananth Kalyanaraman , Washington State University, Pullman, WA
pp. 1-10

An adaptive cut-off for task parallelism (Abstract)

Eduard Ayguadé , Universitat Politècnica de Catalunya
Julita Corbalán , Universitat Politècnica de Catalunya
Alejandro Duran , Universitat Politècnica de Catalunya
pp. 1-11

EpiSimdemics: an efficient algorithm for simulating the spread of infectious disease over large realistic social networks (Abstract)

Xizhou Feng , Virginia Tech, Blacksburg, VA
Stephen G. Eubank , Virginia Tech, Blacksburg, VA
Christopher L. Barrett , Virginia Tech, Blacksburg, VA
Madhav V. Marathe , Virginia Tech, Blacksburg, VA
Keith R. Bisset , Virginia Tech, Blacksburg, VA
pp. 1-12

Programming the Intel 80-core network-on-a-chip terascale processor (Abstract)

Timothy G. Mattson , Intel Corp., DuPont, WA
Rob Van der Wijngaart , Intel Corp., Santa Clara, CA
Michael Frumkin , Google Inc., Mountain View, CA
pp. 1-11

PAM: a novel performance/power aware meta-scheduler for multi-core systems (Abstract)

Dan Poff , IBM Thomas J. Watson Research Center, Hawthorne, NY
Bulent Abali , IBM Thomas J. Watson Research Center, Hawthorne, NY
Mohammad Banikazemi , IBM Thomas J. Watson Research Center, Hawthorne, NY
pp. 1-12

Hiding I/O latency with pre-execution prefetching for parallel applications (Abstract)

Xian-He Sun , Illinois Institute of Technology, Chicago, IL
Surendra Byna , Illinois Institute of Technology, Chicago, IL
Yong Chen , Illinois Institute of Technology, Chicago, IL
Rajeev Thakur , Argonne National Laboratory, Argonne, IL
William Gropp , University of Illinois Urbana-Champaign, Urbana, IL
pp. 1-10

A dynamic scheduler for balancing HPC applications (Abstract)

Carlos Boneti , Universitat Politecnica de Catalunya, Spain
Francisco J. Cazorla , Barcelona Supercomputing Center, Spain
Roberto Gioiosa , Barcelona Supercomputing Center, Spain
Mateo Valero , Barcelona Supercomputing Center, Spain and Universitat Politecnica de Catalunya, Spain
pp. 1-12

Characterizing and predicting the I/O performance of HPC applications using a parameterized synthetic benchmark (Abstract)

Hongzhang Shan , Lawrence Berkeley National Laboratory, Berkeley, CA
John Shalf , Lawrence Berkeley National Laboratory, Berkeley, CA
Katie Antypas , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-12

Proactive process-level live migration in HPC environments (Abstract)

Chao Wang , North Carolina State University, Raleigh, NC
Stephen L. Scott , Oak Ridge National Laboratory, Oak Ridge, TN
Christian Engelmann , Oak Ridge National Laboratory, Oak Ridge, TN
Frank Mueller , North Carolina State University, Raleigh, NC
pp. 1-12

Parallel I/O prefetching using MPI file caching and I/O signatures (Abstract)

Xian-He Sun , Illinois Institute of Technology, Chicago, IL
Yong Chen , Illinois Institute of Technology, Chicago, IL
William Gropp , University of Illinois Urbana-Champaign, Urbana, IL
Rajeev Thakur , Argonne National Laboratory, Argonne, IL
Surendra Byna , Illinois Institute of Technology, Chicago, IL
pp. 1-12

BitDew: a programmable environment for large-scale data management and distribution (Abstract)

Gilles Fedak , Univ Paris-Sud, CNRS, Orsay
Franck Cappello , Univ Paris-Sud, CNRS, Orsay
Haiwu He , Univ Paris-Sud, CNRS, Orsay
pp. 1-12

Scalable load-balance measurement for SPMD codes (Abstract)

Martin Schulz , Lawrence Livermore National Laboratory
Todd Gamblin , University of North Carolina at Chapel Hill
Daniel A. Reed , Microsoft Research
Bronis R. de Supinski , Lawrence Livermore National Laboratory
Rob Fowler , University of North Carolina at Chapel Hill
pp. 1-12

Using overlays for efficient data transfer over shared wide-area networks (Abstract)

Tahsin Kurc , The Ohio State University
Umit Catalyurek , The Ohio State University
Rajkumar Kettimuthu , Argonne National Laboratory
Ian Foster , Argonne National Laboratory
Joel Saltz , The Ohio State University
P. Sadayappan , The Ohio State University
Gaurav Khanna , The Ohio State University
pp. 1-12

Massively parallel volume rendering using 2-3 swap image compositing (Abstract)

Chaoli Wang , University of California at Davis
Kwan-Liu Ma , University of California at Davis
Hongfeng Yu , University of California at Davis
pp. 1-11

Capturing performance knowledge for automated analysis (Abstract)

Oscar Hernandez , University of Houston, Houston, TX
Sunita Chandrasekaran , Nanyang Technological University, Singapore
Barbara Chapman , University of Houston, Houston, TX
Boyana Norris , Argonne National Laboratory, Argonne, IL
Kevin A. Huck , University of Oregon, Eugene, OR
Van Bui , University of Houston, Houston, TX
Allen D. Malony , University of Oregon, Eugene, OR
Lois Curfman McInnes , Argonne National Laboratory, Argonne, IL
pp. 1-10

The cost of doing science on the cloud: the Montage example (Abstract)

Ewa Deelman , USC Information Sciences Institute, Marina del Rey, CA
Gurmeet Singh , USC Information Sciences Institute, Marina del Rey, CA
Bruce Berriman , California Institute of Technology, Pasadena, CA
John Good , California Institute of Technology, Pasadena, CA
Miron Livny , University of Wisconsin Madison, Madison, WI
pp. 1-12

High performance multivariate visual data exploration for extremely large data (Abstract)

Peter Messmer , Tech-X Corporation, Boulder, CO
Bernd Hamann , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California, Davis, CA and Technische Universität Kaiserslautern, Kaiserslautern, Germany
E. Wes Bethel , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California, Davis, CA
Gunther H. Weber , Lawrence Berkeley National Laboratory, Berkeley, CA
Hans Hagen , Technische Universität Kaiserslautern, Kaiserslautern, Germany
Prabhat , Lawrence Berkeley National Laboratory, Berkeley, CA
Hank Childs , Lawrence Livermore National Laboratory, Livermore, CA
Oliver Rübel , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California, Davis, CA and Technische Universität Kaiserslautern, Kaiserslautern, Germany
Sean Ahern , Oak Ridge National Laboratory, Oak Ridge, TN
Estelle Cormier-Michel , LOASIS program of Lawrence Berkeley National Laboratory, Berkeley, CA
Kesheng Wu , Lawrence Berkeley National Laboratory, Berkeley, CA
Cameron G. R. Geddes , LOASIS program of Lawrence Berkeley National Laboratory, Berkeley, CA
Jeremy Meredith , Oak Ridge National Laboratory, Oak Ridge, TN
pp. 1-12

Server-storage virtualization: integration and load balancing in data centers (Abstract)

Dushmanta Mohapatra , Georgia Tech
Madhukar Korupolu , IBM Almaden Research Center
Aameek Singh , IBM Almaden Research Center
pp. 1-12

Materialized community ground models for large-scale earthquake simulation (Abstract)

Ricardo Taborda , Carnegie Mellon University
Julio López , Carnegie Mellon University
Michael P. Ryan , Intel Research Pittsburgh
Jacobo Bielak , Carnegie Mellon University
David R. O'Hallaron , Intel Research Pittsburgh and Carnegie Mellon University
Steven W. Schlosser , Intel Research Pittsburgh
pp. 1-12

Positivity, posynomials and tile size selection (Abstract)

Sanjay Rajopadhye , Colorado State University, Fort Collins, Colorado
Lakshminarayanan Renganarayana , IBM T.J. Watson Research Center, Yorktown Heights, New York
pp. 1-12

A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories (Abstract)

Patrick Miller , D. E. Shaw Research, New York, NY
Charles A. Rendleman , D. E. Shaw Research, New York, NY
Morten Ø. Jensen , D. E. Shaw Research, New York, NY
David W. Borhani , D. E. Shaw Research, New York, NY
David E. Shaw , D. E. Shaw Research, New York, NY
Tiankai Tu , D. E. Shaw Research, New York, NY
Paul Maragakis , D. E. Shaw Research, New York, NY
Justin Gullingsrud , D. E. Shaw Research, New York, NY
John L. Klepeis , D. E. Shaw Research, New York, NY
Ron O. Dror , D. E. Shaw Research, New York, NY
Kate A. Stafford , D. E. Shaw Research, New York, NY
pp. 1-12

Global trees: a framework for linked data structures on distributed memory parallel systems (Abstract)

Sriram Krishnamoorthy , Pacific Northwest National Laboratory, Richland, WA
Atanas Rountev , The Ohio State University, Columbus, OH
D. Brian Larkins , The Ohio State University, Columbus, OH
James Dinan , The Ohio State University, Columbus, OH
Srinivasan Parthasarathy , The Ohio State University, Columbus, OH
P. Sadayappan , The Ohio State University, Columbus, OH
pp. 1-13

Parallel exact inference on the cell broadband engine processor (Abstract)

Viktor K. Prasanna , University of Southern California, Los Angeles, CA
Yinglong Xia , University of Southern California, Los Angeles, CA
pp. 1-12

Prefetch throttling and data pinning for improving performance of shared caches (Abstract)

Mustafa Karakoy , Imperial College
Seung Woo Son , Pennsylvania State University
Mahmut Kandemir , Pennsylvania State University
Ozcan Ozturk , Bilkent University
pp. 1-12

High-frequency simulations of global seismic wave propagation using SPECFEM3D_GLOBE on 62K processors (Abstract)

Allan Snavely , San Diego Supercomputer Center, La Jolla, CA
Nicolas Le Goff , Université de Pau, Pau, France
Michael Laurenzano , San Diego Supercomputer Center, La Jolla, CA
Jeroen Tromp , California Institute of Technology, Pasadena, CA
Laura Carrington , San Diego Supercomputer Center, La Jolla, CA
David Michéa , Université de Pau, Pau, France
Mustafa M Tikir , San Diego Supercomputer Center, La Jolla, CA
Dimitri Komatitsch , Université de Pau, Pau, France and Institut Universitaire de France, Paris, France
pp. 1-11

New algorithm to enable 400+ TFlop/s sustained performance in simulations of disorder effects in high-Tc superconductors (Abstract)

E. F. D'Azevedo , Oak Ridge National Laboraotry, Oak Ridge TN
D. E. Maxwell , Oak Ridge National Laboraotry, Oak Ridge TN
P. R. C. Kent , Oak Ridge National Laboraotry, Oak Ridge TN
J. Levesque , Cray Incorporated, Oak Ridge, TN
M. S. Summers , Oak Ridge National Laboraotry, Oak Ridge TN
J. M. Larkin , Cray Incorporated, Oak Ridge, TN
G. Alvarez , Oak Ridge National Laboraotry, Oak Ridge TN
T. A. Maier , Oak Ridge National Laboraotry, Oak Ridge TN
T. C. Schulthess , Oak Ridge National Laboraotry, Oak Ridge TN
J. S. Meredith , Oak Ridge National Laboraotry, Oak Ridge TN
M. Eisenbach , Oak Ridge National Laboraotry, Oak Ridge TN
pp. 1-10

Scalable adaptive mantle convection simulation on petascale supercomputers (Abstract)

Shijie Zhong , University of Colorado, Boulder, Colorado
Eh Tan , California Institute of Technology, Pasadena, California
Carsten Burstedde , The University of Texas at Austin, Austin, Texas
Lucas C. Wilcox , The University of Texas at Austin, Austin, Texas
Omar Ghattas , The University of Texas at Austin, Austin, Texas
Michael Gurnis , California Institute of Technology, Pasadena, California
Tiankai Tu , The University of Texas at Austin, Austin, Texas
Georg Stadler , The University of Texas at Austin, Austin, Texas
pp. 1-15

0.374 Pflop/s trillion-particle kinetic modeling of laser plasma interaction on Roadrunner (Abstract)

B. J. Albright , X-1-PTA Plasma Theory and Applications
K. J. Bowers , X-1-PTA Plasma Theory and Applications
D. J. Kerbyson , Computing of the Los Alamos National Laboratory, Los Alamos, NM
K. J. Barker , Computing of the Los Alamos National Laboratory, Los Alamos, NM
L. Yin , X-1-PTA Plasma Theory and Applications
B. Bergen , CCS-2 Computational Physics
pp. 1-11

369 Tflop/s molecular dynamics simulations on the Roadrunner general-purpose heterogeneous supercomputer (Abstract)

Timothy C. Germann , Los Alamos National Laboratory, Los Alamos, NM
Kai Kadau , Los Alamos National Laboratory, Los Alamos, NM
Gordon C. Fossum , IBM Corporation, Austin, TX
Sriram Swaminarayan , Los Alamos National Laboratory, Los Alamos, NM
pp. 1-10

Linearly scaling 3D fragment method for large-scale electronic structure calculations (Abstract)

Byounghak Lee , Lawrence Berkeley National Laboratory, Berkeley, CA
Erich Strohmaier , Lawrence Berkeley National Laboratory, Berkeley, CA
Zhengji Zhao , Lawrence Berkeley National Laboratory, Berkeley, CA
Lin-Wang Wang , Lawrence Berkeley National Laboratory, Berkeley, CA
Hongzhang Shan , Lawrence Berkeley National Laboratory, Berkeley, CA
David H. Bailey , Lawrence Berkeley National Laboratory, Berkeley, CA
Juan Meza , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-10
105 ms
(Ver )