The Community for Technology Leaders
SC Conference (2007)
Reno, Nevada
Nov. 10, 2007 to Nov. 16, 2007
ISBN: 978-1-59593-764-3
TABLE OF CONTENTS
Papers
Front Matter

Front Matter (PDF)

pp. i-xix
Papers

Programming bits and atoms (Abstract)

Neil Gershenfeld , Massachusetts Institute of Technology
pp. 1

A preliminary investigation of a neocortex model implementation on the Cray XD1 (Abstract)

Kenneth L. Rice , Clemson University, Clemson, SC
Christopher N. Vutsinas , Clemson University, Clemson, SC
Tarek M. Taha , Clemson University, Clemson, SC
pp. 1-8

Anatomy of a cortical simulator (Abstract)

Rajagopal Ananthanarayanan , IBM Almaden Research Center, San Jose, CA
Dharmendra S. Modha , IBM Almaden Research Center, San Jose, CA
pp. 1-12

Large-scale maximum likelihood-based phylogenetic analysis on the IBM BlueGene/L (Abstract)

Alexandros Stamatakis , School of Computer and Communication Sciences
Michael Ott , Technical University of Munich
Jaroslaw Zola , Iowa State University
Srinivas Aluru , Iowa State University
pp. 1-11

Age-based packet arbitration in large-radix k-ary n-cubes (Abstract)

Dennis Abts , Cray Inc., Chippewa Falls, Wisconsin
Deborah Weisser , Google Inc., Mountain View, California
pp. 1-11

Evaluating network information models on resource efficiency and application performance in lambda-grids (Abstract)

Nut Taesombut , University of California, La Jolla, CA
Andrew A. Chien , University of California, La Jolla, CA
pp. 1-12

Virtual machine aware communication libraries for high performance computing (Abstract)

Qi Gao , The Ohio State University, Columbus, OH
Matthew J. Koop , The Ohio State University, Columbus, OH
Wei Huang , The Ohio State University, Columbus, OH
Dhabaleswar K. Panda , The Ohio State University, Columbus, OH
pp. 1-12

Investigation of leading HPC I/O performance using a scientific-application derived benchmark (Abstract)

Hongzhang Shan , Lawrence Berkeley National Laboratory, Berkeley, CA
Julian Borrill , Lawrence Berkeley National Laboratory, Berkeley, CA
John Shalf , Lawrence Berkeley National Laboratory, Berkeley, CA
Leonid Oliker , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-12

Automatic resource specification generation for resource selection (Abstract)

Henri Casanova , University of Hawai'i at Manoa
Richard Huang , University of California, San Diego
Andrew A. Chien , University of California, San Diego
pp. 1-11

Performance and cost optimization for multiple large-scale grid workflow applications (Abstract)

Rubing Duan , University of Innsbruck
Thomas Fahringer , University of Innsbruck
Radu Prodan , University of Innsbruck
pp. 1-12

Inter-operating grids through delegated matchmaking (Abstract)

Todd Tannenbaum , University of Wisconsin, Madison, WI, US
Alexandru Iosup , Delft University of Technology, Delft, NL
Miron Livny , University of Wisconsin, Madison, WI, US
Matthew Farrellee , University of Wisconsin, Madison, WI, US
Dick H. J. Epema , Delft University of Technology, Delft, NL
pp. 1-12

Automatic software interference detection in parallel applications (Abstract)

Vahid Tabatabaee , University of Maryland at College Park
Jeffrey K. Hollingsworth , University of Maryland at College Park
pp. 1-12

DMTracker: finding bugs in large-scale parallel programs by detecting anomaly in data movements (Abstract)

Qi Gao , The Ohio State University, Columbus, OH
Dhabaleswar K. Panda , The Ohio State University, Columbus, OH
Feng Qin , The Ohio State University, Columbus, OH
pp. 1-12

Scalable security for petascale parallel file systems (Abstract)

Andrew W. Leung , University of California, Santa Cruz, CA
Ethan L. Miller , University of California, Santa Cruz, CA
Stephanie Jones , University of California, Santa Cruz, CA
pp. 1-12

The Cray BlackWidow: a highly scalable vector multiprocessor (Abstract)

Gerald Schwoerer , Cray Inc., Chippewa Falls, Wisconsin
Greg Faanes , Cray Inc., Chippewa Falls, Wisconsin
Jim Schwarzmeier , Cray Inc., Chippewa Falls, Wisconsin
Abdulla Bataineh , Cray Inc., Chippewa Falls, Wisconsin
Eric Lundberg , Cray Inc., Chippewa Falls, Wisconsin
Tim Johnson , Cray Inc., Chippewa Falls, Wisconsin
Mike Bye , Cray Inc., Chippewa Falls, Wisconsin
Steve Scott , Cray Inc., Chippewa Falls, Wisconsin
Dennis Abts , Cray Inc., Chippewa Falls, Wisconsin
pp. 1-12

GRAPE-DR: 2-Pflops massively-parallel computer with 512-core, 512-Gflops processor chips for scientific computing (Abstract)

Mary Inaba , The University of Tokyo, Tokyo, Japan
Junichiro Makino , National Astronomical Observatory of Japan, Tokyo, Japan
Kei Hiraki , The University of Tokyo, Tokyo, Japan
pp. 1-11

A case for low-complexity MP architectures (Abstract)

Erik Hagersten , Uppsala University, Uppsala, Sweden
Hâkan Zeffer , Uppsala University, Uppsala, Sweden
pp. 1-12

Variable latency caches for nanoscale processor (Abstract)

Ja Chun Ku , Northwestern University, Evanston, IL
Serkan Ozdemir , Northwestern University, Evanston, IL
Yehea Ismail , Northwestern University, Evanston, IL
Arindam Mallik , Northwestern University, Evanston, IL
Gokhan Memik , Northwestern University, Evanston, IL
pp. 1-10

Data access history cache and associated data prefetching mechanisms (Abstract)

Yong Chen , Illinois Institute of Technology, Chicago, IL
Surendra Byna , Illinois Institute of Technology, Chicago, IL
Xian-He Sun , Illinois Institute of Technology, Chicago, IL and Fermi National Accelerator Laboratory, Batavia, IL
pp. 1-12

Scaling performance of interior-point method on large-scale chip multiprocessor system (Abstract)

Anthony D Nguyen , Microprocessor Technology Labs, Intel
Mikhail Smelyanskiy , Microprocessor Technology Labs, Intel
Daehyun Kim , Microprocessor Technology Labs, Intel
Victor W Lee , Microprocessor Technology Labs, Intel
Pradeep Dubey , Microprocessor Technology Labs, Intel
pp. 1-11

Data exploration of turbulence simulations using a database cluster (Abstract)

Randal Burns , Johns Hopkins University, Baltimore, MD
Eric Perlman , Johns Hopkins University, Baltimore, MD
Charles Meneveau , Johns Hopkins University, Baltimore, MD
Yi Li , Johns Hopkins University, Baltimore, MD
pp. 1-11

Parallel hierarchical visualization of large time-varying 3D vector fields (Abstract)

Chaoli Wang , University of California at Davis
Hongfeng Yu , University of California at Davis
Kwan-Liu Ma , University of California at Davis
pp. 1-12

Low-constant parallel algorithms for finite element simulations using linear octrees (Abstract)

Hari Sundar , University of Pennsylvania, Philadelphia, PA
Santi S. Adavani , University of Pennsylvania, Philadelphia, PA
George Biros , University of Pennsylvania, Philadelphia, PA
Rahul S. Sampath , University of Pennsylvania, Philadelphia, PA
Christos Davatzikos , University of Pennsylvania, Philadelphia, PA
pp. 1-12

Noncontiguous locking techniques for parallel file systems (Abstract)

Alok Choudhary , Northwestern University, Evanston, Illinois
Lee Ward , Sandia National Laboratories, Albuquerque, NM
Robert Ross , Argonne National Laboratory, Argonne, IL
Avery Ching , Northwestern University, Evanston, Illinois
Wei-keng Liao , Northwestern University, Evanston, Illinois
pp. 1-12

Integrating parallel file systems with object-based storage devices (Abstract)

Ananth Devulapalli , Ohio Supercomputer Center
P. Sadayappan , The Ohio State University
Nawab Ali , The Ohio State University
Dennis Dalessandro , Ohio Supercomputer Center
Pete Wyckoff , Ohio Supercomputer Center
pp. 1-10

Evaluation of active storage strategies for the lustre parallel file system (Abstract)

Jarek Nieplocha , Pacific Northwest National Laboratory, Richland, WA
Evan J. Felix , Pacific Northwest National Laboratory, Richland, WA
Juan Piernas , Pacific Northwest National Laboratory, Richland, WA
pp. 1-10

The ghost in the machine: observing the effects of kernel operation on parallel application performance (Abstract)

Allen D. Malony , University of Oregon, Eugene, OR
Pete Beckman , National Lab
Matthew Sottile , Los Alamos National Lab
Aroon Nataraj , University of Oregon, Eugene, OR
Alan Morris , University of Oregon, Eugene, OR
pp. 1-12

PNMPI tools: a whole lot greater than the sum of their parts (Abstract)

Martin Schulz , Lawrence Livermore National Laboratory, Livermore, CA
Bronis R. de Supinski , Lawrence Livermore National Laboratory, Livermore, CA
pp. 1-10

Multi-threading and one-sided communication in parallel LU factorization (Abstract)

Parry Husbands , Lawrence Berkeley National Laboratory, Berkeley, CA
Katherine Yelick , University of California at Berkeley, Berkeley, CA
pp. 1-10

Workstation capacity tuning using reinforcement learning (Abstract)

Ran Gilad-Bachrach , Intel Research Israel
Liat Ein-Dor , Intel Research Israel
Amir Di-Nur , Intel Inc.
Aharon Bar-Hillel , Intel Research Israel
Yossi Ittach , Intel Research Israel
pp. 1-11

Anomaly detection and diagnosis in grid environments (Abstract)

Jennifer M. Schopf , Argonne National Laboratory, Argonne, IL
Lingyun Yang , University of Chicago, Chicago, IL
Chuang Liu , Microsoft, Redmond, WA
Ian Foster , University of Chicago, Chicago, IL and Argonne National Laboratory, Argonne, IL
pp. 1-9

User-friendly and reliable grid computing based on imperfect middleware (Abstract)

Rob V. van Nieuwpoort , Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
Thilo Kielmann , Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
Henri E. Bal , Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
pp. 1-11

Analyzing the impact of supporting out-of-order communication on in-order performance with iWARP (Abstract)

W. Feng , Virginia Tech
D. K. Panda , Ohio State University
P. Balaji , Argonne National Laboratory
W. Gropp , Argonne National Laboratory
S. Bhagvat , Dell Inc.
R. Thakur , Argonne National Laboratory
pp. 1-12

Evaluating NIC hardware requirements to achieve high message rate PGAS support on multi-core processors (Abstract)

Michael J. Levenhagen , Sandia National Laboratories, Albuquerque, NM
Keith D. Underwood , Sandia National Laboratories, Albuquerque, NM
Ron Brightwell , Sandia National Laboratories, Albuquerque, NM
pp. 1-10

High-performance ethernet-based communications for future multi-core processors (Abstract)

Michael Schlansker , Hewlett-Packard Labs/Advanced Architecture Lab
Dennis Bradford , Intel Corporation/Corporate Technology Group
Erwin Oertli , VMware
Nathan Binkert , Hewlett-Packard Labs/Advanced Architecture Lab
Richard J. Carter , Hewlett-Packard Labs/Advanced Architecture Lab
Nagabhushan Chitlur , Intel Corporation/Corporate Technology Group
Linda Rankin , Intel Corporation/Corporate Technology Group
Paul M. Stillwell , Intel Corporation/Corporate Technology Group
Jayaram Mudigonda , Hewlett-Packard Labs/Advanced Architecture Lab
Norman P. Jouppi , Hewlett-Packard Labs/Advanced Architecture Lab
pp. 1-12

Optimization of sparse matrix-vector multiplication on emerging multicore platforms (Abstract)

Leonid Oliker , Lawrence Berkeley National Laboratory, Berkeley, CA
James Demmel , University of California at Berkeley, Berkeley, CA
Katherine Yelick , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
Richard Vuduc , Lawrence Livermore National Laboratory, Livermore, CA
John Shalf , Lawrence Berkeley National Laboratory, Berkeley, CA
Samuel Williams , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
pp. 1-12

Cray XT4: an early evaluation for petascale scientific simulation (Abstract)

Sadaf R. Alam , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Ramanan Sankaran , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Patrick H. Worley , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Jeff M. Larkin , Cray Inc, Seattle, Washington
Richard F. Barrett , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Mark R. Fahey , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Jeffery A. Kuehn , Oak Ridge National Laboratory, Oak Ridge, Tennessee
pp. 1-12

An adaptive mesh refinement benchmark for modern parallel programming languages (Abstract)

Phillip Colella , Lawrence Berkeley National Laboratory, Berkeley, CA
Tong Wen , IBM T. J. Watson Research Center, Hawthorne, NY
Jimmy Su , University of California, Berkeley, CA
Katherine Yelick , University of California, Berkeley, CA
Noel Keen , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-12

Exploring event correlation for failure prediction in coalitions of clusters (Abstract)

Cheng-Zhong Xu , Wayne State University, Detroit, MI
Song Fu , Wayne State University, Detroit, MI
pp. 1-12

Advanced data flow support for scientific grid workflow applications (Abstract)

Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
Jun Qin , University of Innsbruck, Innsbruck, Austria
pp. 1-12

Falkon: a Fast and Light-weight tasK executiON framework (Abstract)

Ian Foster , University of Chicago and Argonne National Laboratory, Argonne, IL
Ioan Raicu , University of Chicago, IL
Mike Wilde , University of Chicago and Argonne National Laboratory, Argonne, IL
Catalin Dumitrescu , University of Chicago, IL
Yong Zhao , University of Chicago, IL
pp. 1-12

RobuSTore: a distributed storage architecture with robust and high performance (Abstract)

Huaxia Xia , University of California, San Diego, La Jolla, CA
Andrew A. Chien , University of California, San Diego, La Jolla, CA
pp. 1-11

A user-level secure grid file system (Abstract)

Ming Zhao , University of Florida
Renato J. Figueiredo , University of Florida
pp. 1-11

Efficient gather and scatter operations on graphics processors (Abstract)

Qiong Luo , Hong Kong Univ. of Science and Technology
Bingsheng He , Hong Kong Univ. of Science and Technology
Burton Smith , Microsoft Corp.
Naga K. Govindaraju , Microsoft Corp.
pp. 1-12

A genetic algorithms approach to modeling the performance of memory-bound computations (Abstract)

Laura Carrington , San Diego Supercomputer Center, La Jolla, CA
Erich Strohmaier , Lawrence Berkeley National Laboratory, One Cyclotron Road, CA
Allan Snavely , San Diego Supercomputer Center, La Jolla, CA
Mustafa M Tikir , San Diego Supercomputer Center, La Jolla, CA
pp. 1-12

Performance under failures of high-end computing (Abstract)

Xian-He Sun , Illinois Institute of Technology, Chicago, Illinois and Fermi National Accelerator Laborator, Batavia, Illinois
Hui Jin , Illinois Institute of Technology, Chicago, Illinois
Ming Wu , Illinois Institute of Technology, Chicago, Illinois
pp. 1-11

Bounding energy consumption in large-scale MPI programs (Abstract)

Martin Schulz , Lawrence Livermore National Laboratory, Livermore, CA
Bronis R. de Supinski , Lawrence Livermore National Laboratory, Livermore, CA
Shelby Funk , University of Georgia, Athens, GA
David K. Lowenthal , University of Georgia, Athens, GA
Barry Rountree , University of Georgia, Athens, GA
Vincent W. Freeh , North Carolina State University, Raleigh, NC
pp. 1-9

Application development on hybrid systems (Abstract)

Patrick Crowley , Washington University, St. Louis, Missouri
Saurabh Gayen , Washington University, St. Louis, Missouri
Eric J. Tyson , Washington University, St. Louis, Missouri
Roger D. Chamberlain , Washington University, St. Louis, Missouri
Jeremy Buhler , Washington University, St. Louis, Missouri
Mark A. Franklin , Washington University, St. Louis, Missouri
James H. Buckley , Washington University, St. Louis, Missouri
pp. 1-10

Multi-level tiling: M for the price of one (Abstract)

Sanjay Rajopadhye , Colorado State University, Fort Collins, Colorado
Michelle Mills Strout , Colorado State University, Fort Collins, Colorado
Lakshminarayanan Renganarayanan , Colorado State University, Fort Collins, Colorado
Dave Rostron , Colorado State University, Fort Collins, Colorado
DaeGon Kim , Colorado State University, Fort Collins, Colorado
pp. 1-12

Implementation and performance analysis of non-blocking collective operations for MPI (Abstract)

Wolfgang Rehm , Chemnitz University of Technology, Chemnitz, Germany
Torsten Hoefler , Indiana University, Bloomington, IN
Andrew Lumsdaine , Indiana University, Bloomington, IN
pp. 1-10

Efficient operating system scheduling for performance-asymmetric multi-core architectures (Abstract)

David A. Koufaty , Intel Corporation
Scott Hahn , Intel Corporation
Dan Baumberger , Intel Corporation
Tong Li , Intel Corporation
pp. 1-11

A job scheduling framework for large computing farms (Abstract)

Marco Pasquali , Information Science and Technologies Institute, Pisa, Italy
Ranieri Baraglia , Information Science and Technologies Institute, Pisa, Italy
Gabriele Capannini , Information Science and Technologies Institute, Pisa, Italy
Laura Ricci , Largo B. Pontecorvo, Pisa, Italy
Diego Puppin , Information Science and Technologies Institute, Pisa, Italy
pp. 1-10

Optimizing center performance through coordinated data staging, scheduling and recovery (Abstract)

Gregory G. Pike , Oak Ridge National Laboratory
John W. Cobb , Oak Ridge National Laboratory
Zhe Zhang , North Carolina State University
Frank Mueller , North Carolina State University
Sudharshan S. Vazhkudai , Oak Ridge National Laboratory
Xiaosong Ma , North Carolina State University and Oak Ridge National Laboratory
Chao Wang , North Carolina State University
pp. 1-11

A 281 Tflops calculation for X-ray protein structure analysis with special-purpose computers MDGRAPE-3 (Abstract)

Ryutaro Himeno , Nagoya University, Keio University and University of Fukui
Tetsu Narumi , Nagoya University, Keio University and University of Fukui
Takahiro Koishi , Nagoya University, Keio University and University of Fukui
Makoto Taiji , Nagoya University, Keio University and University of Fukui
Hideo Ago , Nagoya University, Keio University and University of Fukui
Eiji Nishibori , Nagoya University, Keio University and University of Fukui
Makoto Sakata , Nagoya University, Keio University and University of Fukui
Tahir H. Tahirov , Nagoya University, Keio University and University of Fukui
Masashi Miyano , Nagoya University, Keio University and University of Fukui
Toshikazu Ebisuzaki , Nagoya University, Keio University and University of Fukui
Yousuke Ohno , Nagoya University, Keio University and University of Fukui
pp. 1-10

First-principles calculations of large-scale semiconductor systems on the earth simulator (Abstract)

Takenori Yamamoto , Toho University, Funabashi, Chiba, Japan
Takahisa Ohno , Material Science Center (NIMS-CMSC), Tsukuba, Ibaraki, Japan
Daisuke Fukata , NEC Soft, Ltd., Koto, Tokyo, Japan
Akira Azami , NEC Informatec Systems, Ltd., Takatsu, Kawasaki, Japan
Yuta Sakaguchi , Advanced Soft Engineering, Inc., Chuo, Tokyo, Japan
Tatsunobu Kokubo , NEC Corporation, Fuchu, Tokyo, Japan
Junichiro Koga , AdvanceSoft Corporation, Minato, Tokyo, Japan
Tsuyoshi Uda , AdvanceSoft Corporation, Minato, Tokyo, Japan
Takahiro Yamasaki , University of Tokyo, Tokyo, Japan
pp. 1-6

Extending stability beyond CPU millennium: a micron-scale atomistic simulation of Kelvin-Helmholtz instability (Abstract)

K. J. Caspersen , Lawrence Livermore National Laboratory, Livermore, CA
D. F. Richards , Lawrence Livermore National Laboratory, Livermore, CA
J. A. Gunnels , IBM Corporation, Yorktown Heights, New York
J. N. Glosli , Lawrence Livermore National Laboratory, Livermore, CA
F. H. Streitz , Lawrence Livermore National Laboratory, Livermore, CA
R. E. Rudd , Lawrence Livermore National Laboratory, Livermore, CA
pp. 1-11

WRF nature run (Abstract)

Michael O. McCracken , PMaC Laboratory San Diego Supercomputer Center, La Jolla, CA
Robert Walkup , IBM Thomas J. Watson Research Center, Yorktown Heights, NY
Nicholas J. Wright , PMaC Laboratory San Diego Supercomputer Center, La Jolla, CA
Brent Gorda , Lawrence Livermore National Laboratory, Livermore, CA
Josh Hacker , University Corporation for Atmospheric Research (UCAR), Boulder, CO
Tom Spelce , Lawrence Livermore National Laboratory, Livermore, CA
John Michalakes , University Corporation for Atmospheric Research (UCAR), Boulder, CO
Richard Loft , University Corporation for Atmospheric Research (UCAR), Boulder, CO
Allan Snavely , PMaC Laboratory San Diego Supercomputer Center, La Jolla, CA
pp. 1-6
Back Matter

Back Matter (PDF)

pp. z-z17
90 ms
(Ver )