The Community for Technology Leaders
Parallel and Distributed Processing Symposium, International (2004)
Santa Fe, New Mexico
Apr. 26, 2004 to Apr. 30, 2004
ISBN: 0-7695-2132-0
TABLE OF CONTENTS
Session 1: Scheduling and Mapping

Scheduling of Query Execution Plans in Symmetric Multiprocessor Database Systems (Abstract)

Chih-wen Hsueh , National Chung Cheng University
Jian-Jia Chen , National Taiwan University
Jun Wu , National Chung Cheng University
Tei-Wei Kuo , National Taiwan University
pp. 2b

A Novel Static Task Scheduling Algorithm in Distributed Computing Environments (Abstract)

Jian-Jun Han , Huzhong University of Science and Technology
Qing-Hua Li , Huzhong University of Science and Technology
pp. 3a

Assignment of Shortest Paths Spanning Trees in Meshes (Abstract)

Christian Destré , LaMI, CNRS-Université déEvry Val déEssonne
Christian Laforest , LaMI, CNRS-Université déEvry Val déEssonne
Sandrine Vial , LaMI, CNRS-Université déEvry Val déEssonne
pp. 4a

Parallel Maximum Weight Bipartite Matching Algorithms for Scheduling in Input-Queued Switches (Abstract)

David Kaeli , Northeastern University
Waleed Meleis , Northeastern University
Morteza Fayyazi , Northeastern University
pp. 4b
Session 2: Scientific Applications I

Performance Characteristics of the Multi-Zone NAS Parallel Benchmarks (Abstract)

Haoqiang Jin , NASA Ames Research Center
Rob F. Van der Wijngaart , NASA Ames Research Center
pp. 6b

A Parallel Object-Oriented Application for 3D Electromagnetism (Abstract)

Said El Kasmi , INRIA Sophia Antipolis, CNRS - I3S - Université Nice Sophia Antipolis
Stéphane Lanteri , INRIA Sophia Antipolis, CNRS - I3S - Université Nice Sophia Antipolis
Laurent Baduel , INRIA Sophia Antipolis, CNRS - I3S - Université Nice Sophia Antipolis
Françoise Baude , INRIA Sophia Antipolis, CNRS - I3S - Université Nice Sophia Antipolis
Christian Delbé , INRIA Sophia Antipolis, CNRS - I3S - Université Nice Sophia Antipolis
Nicolas Gama , INRIA Sophia Antipolis, CNRS - I3S - Université Nice Sophia Antipolis
Denis Caromel , INRIA Sophia Antipolis, CNRS - I3S - Université Nice Sophia Antipolis
pp. 7b

Solving Large Sparse Linear Systems in End-to-end Accelerator Structure Simulations (Abstract)

Michael Wolf , Stanford Linear Accelerator Center
Greg Schussman , Stanford Linear Accelerator Center
Kwok Ko , Stanford Linear Accelerator Center
Marc Kowalski , Stanford Linear Accelerator Center
Zenghai Li , Stanford Linear Accelerator Center
Lie-Quan Lee , Stanford Linear Accelerator Center
Cho-Kuen Ng , Stanford Linear Accelerator Center
Lixin Ge , Stanford Linear Accelerator Center
pp. 8a

Optimization of the POLCOMS Hydrodynamic Code for Terascale High-Performance Computers (Abstract)

J. T. Holt , Bidston Observatory
R. Proctor , Bidston Observatory
M. Ashworth , CCLRC Daresbury Laboratory
pp. 8b
Session 3: Interconnection Networks

BLACK-BUS: A New Data-Transfer Technique Using Local Address on Networks-on-Chips (Abstract)

Hideharu Amano , Keio University
Kenichiro Anjo , Keio University
Yutaka Yamada , Keio University
Akiya Jouraku , Keio University
Michihiro Koibuchi , Keio University
pp. 10a

Fast and Scalable MPI-Level Broadcast Using InfiniBand's Hardware Multicast Support (Abstract)

Amith R Mamidala , Ohio State University
Dhabaleswar K Panda , Ohio State University
Jiuxing Liu , Ohio State University
pp. 10b

A Multiple LID Routing Scheme for Fat-Tree-Based InfiniBand Networks (Abstract)

Xuan-Yi Lin , National Tsing-Hua University
Tai-Yi Huang , National Tsing-Hua University
Yeh-Ching Chung , National Tsing-Hua University
pp. 11a

On Constructing the Minimum Orthogonal Convex Polygon in 2-D Faulty Meshes (Abstract)

Jie Wu , Florida Atlantic University
Zhen Jiang , West Chester University
pp. 12a

LORE — Local Reconfiguration for Fault Management in Irregular Interconnects (Abstract)

Ingebjørg Theiss , Simula Research Laboratory
Olav Lysne , Simula Research Laboratory
pp. 12b
Session 4: Parallel Programming Models/Implementations

High Performance Implementation of MPI Derived Datatype Communication over InfiniBand (Abstract)

Jiesheng Wu , Ohio State University
Dhabaleswar Panda , Ohio State University
Pete Wyckoff , Ohio Supercomputer Center
pp. 14a

Performance Comparison of Pure MPI vs Hybrid MPI-OpenMP Parallelization Models on SMP Clusters (Abstract)

Nikolaos Drosinos , National Technical University of Athens
Nectarios Koziris , National Technical University of Athens
pp. 15a

Architecture of LA-MPI, A Network-Fault-Tolerant MPI (Abstract)

David J. Daniel , Los Alamos National Laboratory
Rob T. Aulwes , Los Alamos National Laboratory
Timothy S. Woodall , Los Alamos National Laboratory
L. Dean Risinger , Los Alamos National Laboratory
Mitchel W. Sukalski , Sandia National Laboratories
Mark A. Taylor , Los Alamos National Laboratory
Nehal N. Desai , Los Alamos National Laboratory
Richard L. Graham , Los Alamos National Laboratory
pp. 15b

The UPC Memory Model: Problems and Prospects (Abstract)

Charles Wallace , Michigan Technological University
William Kuchera , Michigan Technological University
pp. 16a

Design and Implementation of MPICH2 over InfiniBand with RDMA Support (Abstract)

William Gropp , Argonne National Laboratory
Darius Buntinas , Argonne National Laboratory
Jiuxing Liu , Ohio State University
Brian Toonen , Argonne National Laboratory
Pete Wyckoff , Ohio Supercomputer Center
Dhabaleswar K. Panda , Ohio State University
Weihang Jiang , Ohio State University
David Ashton , Argonne National Laboratory
pp. 16b
Session 5: Network Algorithms

Optimal Multi-Channel Data Allocation with Flat Broadcast Per Channel (Abstract)

S. Ramaprasad , Brown University
A. A. Bertossi , University of Bologna
M. V. S. Shashanka , Boston University
M. C. Pinotti , University of Perugia
R. Rizzi , University of Trento
pp. 18b

On the IP Routing Tables Minimization with Addresses Reassignments (Abstract)

Vittorio Bilò , Università di LéAquila
Michele Flammini , Università di LéAquila
pp. 19a

Pipelining Broadcasts on Heterogeneous Platforms (Abstract)

O. Beaumont , LaBRI, UMR CNRS
A. Legrand , LIP, UMR CNRS-INRIA
Y. Robert , LIP, UMR CNRS-INRIA
L. Marchal , LIP, UMR CNRS-INRIA
pp. 19b

Towards Efficient Load Balancing in Structured P2P Systems (Abstract)

Yingwu Zhu , University of Cincinnati
Yiming Hu , University of Cincinnati
pp. 20a

Load Balancing: Dimension Exchange on Product Graphs (Abstract)

Holger Arndt , University of Wuppertal
pp. 20b
Session 6: Grid Applications and Sensor Networks

Single Sign-On in In-VIGO: Role-Based Access via Delegation Mechanisms Using Short-Lived User Identities (Abstract)

Maurício Tsugawa , University of Florida
José A. B. Fortes , University of Florida
Andréa Matsunaga , University of Florida
Renato Figueiredo , University of Florida
Sumalatha Adabala , University of Florida
pp. 22b

A Cluster Oriented Model for Dynamically Balanced DHTs (Abstract)

António Pina , University of Minho
José Rufino , Polytechnic Institute of Bragança
Albano Alves , Polytechnic Institute of Bragança
José Exposto , Polytechnic Institute of Bragança
pp. 23a

Policy Based Scheduling for Simple Quality of Service in Grid Computing (Abstract)

Richard Cavanaugh , University of Florida
Paul Avery , University of Florida
Jang-uk In , University of Florida
Sanjay Ranka , University of Florida
pp. 23b

A New Algorithm for Relative Localization in Wireless Sensor Networks (Abstract)

Yi Shang , University of Missouri-Columbia
Hongchi Shi , University of Missouri-Columbia
Jing Meng , University of Missouri-Columbia
pp. 24a

Malicious Node Detection in Wireless Sensor Networks (Abstract)

Waldir Ribeiro Pires Júnior , Universidade Federal de Minas Gerais
Antonio A. F. Loureiro , Universidade Federal de Minas Gerais
Hao Chi Wong , Universidade Federal de Minas Gerais
Thiago H. de Paula Figueiredo , Universidade Federal de Minas Gerais
pp. 24b
Session 7: Distributed System Architecture

Cycloid: A Constant-Degree and Lookup-Efficient P2P Overlay Network (Abstract)

Cheng-Zhong Xu , Wayne State University
Guihai Chen , Nanjing University
Haiying Shen , Wayne State University
pp. 26a

Characterizing and Evaluating Desktop Grids: An Empirical Study (Abstract)

Charles L. Brooks III , The Scripps Research Institute
Henri Casanova , University of California at San Diego and San Diego Supercomputer Center
Michela Taufer , University of California at San Diego and San Diego Supercomputer Center
Derrick Kondo , University of California at San Diego
Andrew A. Chien , University of California at San Diego
pp. 26b

Distributed Embedded Systems for Low Power: A Case Study (Abstract)

Jinfeng Liu , University of California at Irvine
Pai H. Chou , University of California at Irvine
pp. 27a

How to Run Experiments with Large Peer-to-Peer Data Structures (Abstract)

Klemens Böhm , Otto-von-Guericke Universität
Erik Buchmann , Otto-von-Guericke Universität
pp. 27b

Mobility-Sensitive Topology Control in Mobile Ad Hoc Networks (Abstract)

Jie Wu , Florida Atlantic University
Fei Dai , Florida Atlantic University
pp. 28a
Session 8: Shared Memory Operations/Optimizations/Models

Adaptive Memory Paging for Efficient Gang Scheduling of Parallel Applications (Abstract)

Nimish Pachapurkar , Arizona State University
Liana L. Fong , IBM T.J. Watson Research Center
Kyung Dong Ryu , Arizona State University
pp. 30a

Integrating Remote Invocation and Distributed Shared State (Abstract)

Sandhya Dwarkadas , University of Rochester
Michael L. Scott , University of Rochester
Chunqiang Tang , University of Rochester
DeQing Chen , University of Rochester
pp. 30b

Host-Assisted Zero-Copy Remote Memory Access Communication on InfiniBand (Abstract)

D. K. Panda , Ohio State University
J. Nieplocha , Pacific Northwest National Laboratory
V. Tipparaju , Pacific Northwest National Laboratory
G. Santhanaraman , Ohio State University
pp. 31a

Nemos: A Framework for Axiomatic and Executable Specifications of Memory Consistency Models (Abstract)

Yue Yang , University of Utah
Konrad Slind , University of Utah
Gary Lindstrom , University of Utah
Ganesh Gopalakrishnan , University of Utah
pp. 31b
Session 9: Plenary Session: Best Papers

Translating Submachine Locality into Locality of Reference (Abstract)

Geppino Pucci , University of Padova
Andrea Pietracaprina , University of Padova
Carlo Fantozzi , University of Padova
pp. 34a

Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver (Abstract)

Chi-Chung Lam , Ohio State University
J. Ramanujam , Louisiana State University
Venkatesh Choppella , Indian Institute of Information Technology and Management
P. Sadayappan , Ohio State University
Gerald Baumgartner , Ohio State University
Sriram Krishnamoorthy , Ohio State University
Sandhya Krishnan , Ohio State University
pp. 34b

Designing WDM Optical Interconnects with Full Connectivity by Using Limited Wavelength Conversion (Abstract)

Jianchao Wang , East Isle Technologies Inc.
Yuanyuan Yang , State University of New York at Stony Brook
pp. 35a

Running OpenMP Applications Efficiently on an Everything-Shared SDSM (Abstract)

T. Cortes , Universitat Politècnica de Catalunya - Barcelona
J. J. Costa , Universitat Politècnica de Catalunya - Barcelona
J. Labarta , Universitat Politècnica de Catalunya - Barcelona
X. Martorell , Universitat Politècnica de Catalunya - Barcelona
E. Ayguade , Universitat Politècnica de Catalunya - Barcelona
pp. 35b
Session 10: Parallel Algorithms for Graphs and Multiprocessors

A Fast, Parallel Spanning Tree Algorithm for Symmetric Multiprocessors (Abstract)

David A. Bader , University of New Mexico
Guojing Cong , University of New Mexico
pp. 38a

Fast and Scalable Parallel Algorithms for Euclidean Distance Transform on LARPBS (Abstract)

Ling Chen , Yangzhou University and Nanjing University
Xiao-hua Xu , Yangzhou University
Yi Pan , Georgia State University
pp. 38b

Optimising Static Workload Allocation in Multiclusters (Abstract)

Graham R. Nudd , University of Warwick
Daniel P. Spooner , University of Warwick
Ligang He , University of Warwick
Stephen A. Jarvis , University of Warwick
pp. 39b

A Multiprocessor Implementation of the Total Bandwidth Server (Abstract)

Sanjoy Baruah , University of North Carolina
Giuseppe Lipari , Scuola Superiore S. Anna
pp. 40a

An Algorithm for Geometric Load Balancing with Two Constraints (Abstract)

Marios Papaefthymiou , University of Michigan
Jiyoun Kim , University of Michigan
Athar Tayyab , IBM Microelectronics
pp. 40b
Session 11: Scientific Applications II

A Large Scale Monte Carlo Simulator for Cellular Microphysiology (Abstract)

Thomas M. Bartol , The Salk Institute
Scott B. Baden , University of California at San Diego
Gregory T. Balls , University of California at San Diego
Tilman Kispersky , The Salk Institute
Terrence J. Sejnowski , The Salk Institute
pp. 42a

A Hierarchical Parallel Scheme for Global Parameter Estimation in Systems Biology (Abstract)

J. He , Virginia Polytechnic Institute and State University
J. W. Zwolak , Virginia Polytechnic Institute and State University
J. J. Tyson , Virginia Polytechnic Institute and State University
M. Sosonkina , Iowa State University
L. T. Watson , Virginia Polytechnic Institute and State University
C. A. Shaffer , Virginia Polytechnic Institute and State University
pp. 42b

Parallel Simulation of Fluid Slip in a Microchannel (Abstract)

Jingyu Zhou , University of California at Santa Barbara
Luoding Zhu , University of California at Santa Barbara
Linda Petzold , University of California at Santa Barbara
Tao Yang , University of California at Santa Barbara
pp. 43a

Parallel Brutus: The First Distributed, FPGA Accelerated Chess Program (Abstract)

Ulf Lorenz , University of Paderborn
Alex Kure , University of Paderborn
Chrilly Donninger , University of Paderborn
pp. 44b
Session 12: Distributed Memory and Networks

Taking Advantage of the Overlay Geometrical Structures for Mobile Agent Communications (Abstract)

Amit Banerjee , National Tsing-Hua University
Chung-Ta King , National Tsing-Hua University
Po-Sheng Huang , National Tsing-Hua University
Hung-Chang Hsiao , National Tsing-Hua University
pp. 46a

Building a Scalable Bipartite P2P Overlay Network (Abstract)

Yunhao Liu , Michigan State University
Li Xiao , Michigan State University
Lionel M. Ni , Hong Kong University of Science & Technology
pp. 46b

Specification and Architecture Supports for Component Adaptations on Distributed Environments (Abstract)

Chung-Kai Chen , National Tsing Hua University
Jenq-Kuen Lee , National Tsing Hua University
Cheng-Wei Chen , National Tsing Hua University
pp. 47a

Integrating Program Component Executables on Distributed Memory Architectures via MPH (Abstract)

Yun He , Lawrence Berkeley National Laboratory
Chris Ding , Lawrence Berkeley National Laboratory
pp. 47b

Hierarchical Routing with Soft-State Replicas in TerraDir (Abstract)

Bujor Silaghi , University of Maryland at College Park
Pete Keleher , University of Maryland at College Park
Vijay Gopalakrishnan , University of Maryland at College Park
Bobby Bhattacharjee , University of Maryland at College Park
pp. 48a

Application-Perceived Multicast Push Performance (Abstract)

Vincenzo Liberatore , Case Western Reserve University
Wenhui Zhang , Case Western Reserve University
Wei Li , Case Western Reserve University
pp. 48b
Session 13: Distributed Algorithms and Data Structures

Almost Wait-Free Resizable Hashtables (Abstract)

J. F. Groote , Eindhoven University of Technology
W. H. Hesselink , University of Groningen
H. Gao , University of Groningen
pp. 50a

Star-Coloring of Graphs for Conflict-Free Access to Parallel Memory Systems (Abstract)

Rossella Petreschi , University of Rome "La Sapienza"
Irene Finocchi , University of Rome "Tor Vergata"
Sajal Das , University of Texas at Arlington
pp. 50b

A Distributed Hash Table for Computational Grids (Abstract)

Chris Riley , Johns Hopkins University
Christian Scheideler , Johns Hopkins University
pp. 51a

An Efficient Distributed Mutual Exclusion Algorithm Based on Relative Consensus Voting (Abstract)

Daoxu Chen , Nanjing University
Jingyang Zhou , Nanjing University
Jie Wu , Florida Atlantic University
Jiannong Cao , Hong Kong Polytechnic University
pp. 51b

Distributed Adaptive Task Allocation in Heterogeneous Computing Environments to Maximize Throughput (Abstract)

Viktor K. Prasanna , University of Southern California
Bo Hong , University of Southern California
pp. 52b
Session 14: P2P and Networking Applications

A Neural Network Based Approach for Overlay Multicast in Media Streaming Systems (Abstract)

I-Ling Yen , University of Texas at Dallas
Peng Li , University of Texas at Dallas
Zhonghang Xia , University of Texas at Dallas
pp. 54a

Secure and Reliable Decentralized Peer-to-Peer Web Cache (Abstract)

Bo Sheng , University of Texas at Dallas
Farokh B. Bastani , University of Texas at Dallas
pp. 54b

Exploiting Client Cache: A Scalable and Efficient Approach to Build Large Web Cache (Abstract)

Yiming Hu , University of Cincinnati
Laxmi Bhuyan , University of California at Riverside
Zhiyong Xu , University of California at Riverside
pp. 55a

Diagnostics for Causes of Packet Loss in a High Performance Data Transfer System (Abstract)

Phillip M. Dickens , Illinois Institute of Technology and Argonne National Laboratory
Jay W. Larson , Argonne National Laboratory
David M. Nicol , University of Illinois Urbana-Champaign
pp. 55b

Prediction-Based Routing through Least Cost Delay Constraint (Abstract)

Afshin Shiravi , Washington University in St. Louis
Yoon G. Kim , Washington University in St. Louis
Paul S. Min , Washington University in St. Louis
pp. 56a

A SNAP-Based Community Resource Broker Using a Three-Phase Commit Protocol (Abstract)

Iain Gourlay , University of Leeds
Mohammed H. Haji , University of Leeds
Peter M. Dew , University of Leeds
Karim Djemame , University of Leeds
pp. 56b
Session 15: Parallel System Architecture

Highly Efficient Synchronization Based on Active Memory Operations (Abstract)

Lixin Zhang , IBM Austin Research Lab
John B. Carter , University of Utah
Zhen Fang , University of Utah
pp. 58a

On the Feasibility of Incremental Checkpointing for Scientific Computing (Abstract)

Eitan Frachtenberg , Los Alamos National Laboratory
Greg Johnson , Los Alamos National Laboratory
Juan Fernández , Los Alamos National Laboratory
José Carlos Sancho , Los Alamos National Laboratory
Fabrizio Petrini , Los Alamos National Laboratory
pp. 58b

Multithreaded Home-Based Lazy Release Consistency over VIA (Abstract)

Assaf Schuster , Technion-Israel Institute of Technology
Vadim Iosevich , Technion-Israel Institute of Technology
pp. 59b

A Novel Method for Adding Multiprocessor Support to a Large and Complex Uniprocessor Kernel (Abstract)

Simon Kågström , Blekinge Institute of Technology
Lars Lundberg , Blekinge Institute of Technology
Håkan Grahn , Blekinge Institute of Technology
pp. 60a

Assignment and Scheduling of Real-time DSP Applications for Heterogeneous Functional Units (Abstract)

Meilin Liu , University of Texas at Dallas
Qingfeng Zhuge , University of Texas at Dallas
Zili Shao , University of Texas at Dallas
Edwin H.-M. Sha , University of Texas at Dallas
Chun Xue , University of Texas at Dallas
Yi He , University of Texas at Dallas
pp. 60b
Session 16: Thread/Job Scheduling, Load Balancing and Management

Unobtrusiveness and Efficiency in Idle Cycle Stealing for PC Grids (Abstract)

Kyung Dong Ryu , Arizona State University
Jeffrey K. Hollingsworth , University of Maryland at College Park
pp. 62a

Packet Probing as Network Load Detection for Scientific Applications at Run-Time (Abstract)

Masha Sosonkina , Iowa State University
Sam Storie , University of Minnesota Duluth
pp. 62b

Queue Scheduling and Advance Reservations with COSY (Abstract)

Falk Zimmermann , NEC Europe Ltd.
Junwei Cao , NEC Europe Ltd.
pp. 63a

Towards Efficient Multi-Level Threading of H.264 Encoder on Intel Hyper-Threading Architectures (Abstract)

Steven Ge , Intel Corporation
Yen-Kuang Chen , Intel Corporation
Xinmin Tian , Intel Corporation
Milind Girkar , Intel Corporation
pp. 63b

Fault-Aware Job Scheduling for BlueGene/L Systems (Abstract)

A. J. Oliner , Massachusetts Institute of Technology
M. Gupta , IBM T.J. Watson Research Center
R. K. Sahoo , IBM T.J. Watson Research Center
A. Sivasubramaniam , Pennsylvania State University
J. E. Moreira , IBM T.J. Watson Research Center
pp. 64a
Session 17: Distributed and Mobile Computing

Randomized Smoothing Networks (Abstract)

Srikanta Tirthapura , Iowa State University
Maurice Herlihy , Brown University
pp. 66a

Finding Satisfying Global States: All for One and One for All (Abstract)

Ranganath Atreya , University of Texas at Dallas
Alper Sen , University of Texas at Austin
Vijay K. Garg , University of Texas at Austin
Neeraj Mittal , University of Texas at Dallas
pp. 66b

Energy-Efficient Caching and Prefetching with Data Consistency in Mobile Distributed Systems (Abstract)

Sajal K. Das , University of Texas at Arlington
Huaping Shen , University of Texas at Arlington
Mohan Kumar , University of Texas at Arlington
Zhijun Wang , University of Texas at Arlington
pp. 67a

Transaction Based Dynamic Partial Replication in Mobile Environments (Abstract)

Peng Li , University of Texas at Dallas
I-Ling Yen , University of Texas at Dallas
Manghui Tu , University of Texas at Dallas
pp. 67b

Survivable Systems Based on an Adaptive NMR Algorithm (PDF)

I-Ling Yen , University of Texas at Dallas
Wei Li , University of Texas at Dallas
Farokh Bastani , University of Texas at Dallas
Ing-Ray Chen , Virginia Tech
Qing Kai Ma , University of Texas at Dallas
pp. 68a

An Optimal Protocol for Causally Consistent Distributed Shared Memory Systems (Abstract)

Roberto Baldoni , Università di Roma "La Sapienza"
Sara Tucci Piergiovanni , Università di Roma "La Sapienza"
Alessia Milani , Università di Roma "La Sapienza"
pp. 68b
Session 18: Applications

SRUMMA: A Matrix Multiplication Algorithm Suitable for Clusters and Scalable Shared Memory Systems (Abstract)

Manojkumar Krishnan , Pacific Northwest National Laboratory
Jarek Nieplocha , Pacific Northwest National Laboratory
pp. 70b

Memory-Based Scheduling for a Parallel Multifrontal Solver (Abstract)

Abdou Guermouche , École Normale Supérieure de Lyon
Jean-Yves L'Excellent , École Normale Supérieure de Lyon
pp. 71a

Adapting to Memory Pressure from within Scientific Applications on Multiprogrammed COWs (Abstract)

Dimitrios S. Nikolopoulos , College of William and Mary
Richard T. Mills , College of William and Mary
Andreas Stathopoulos , College of William and Mary
pp. 71b

Parallelization and Performance of Interactive Multiplayer Game Servers (Abstract)

Angelos Bilas , Foundation for Research and Technology
Ahmed Abdelkhalek , University of Toronto
pp. 72a

Distributed Algorithms for Partially Clairvoyant Dispatchers (Abstract)

A. Osman , West Virginia University
K. Subramani , West Virginia University
Kiran Yellajyosula , University of Minnesota
pp. 72b
Session 19: Multiprocessor and Multithreaded Architectures

DCache Warn: An I-Fetch Policy to Increase SMT Efficiency (Abstract)

Mateo Valero , Universitat Politècnica de Catalunya
Alex Ramirez , Universitat Politècnica de Catalunya
Francisco J. Cazorla , Universitat Politècnica de Catalunya
Enrique Fernández , Universidad de Las Palmas de Gran Canaria
pp. 74a

Bundling: Reducing the Overhead of Multiprocessor Prefetchers (Abstract)

Erik Hagersten , Uppsala University
Dan Wallin , Uppsala University
pp. 74b

Using Speculation to Simplify Multiprocessor Design (Abstract)

David A. Wood , University of Wisconsin-Madison
Milo M. K. Martin , University of Pennsylvania
Mark D. Hill , University of Wisconsin-Madison
Daniel J. Sorin , Duke University
pp. 75a

SPEAR: A Hybrid Model for Speculative Pre-Execution (Abstract)

Jean-Luc Gaudiot , University of California at Irvine
Won W. Ro , University of Southern California
pp. 75b

Speculation Control for Simultaneous Multithreading (Abstract)

Jean-Luc Gaudiot , University of California at Irvine
Dongsoo Kang , University of Southern California
pp. 76a

Clustered Multithreaded Architectures — Pursuing both IPC and Cycle Time (Abstract)

Dean M. Tullsen , University of California at San Diego
Jamison D. Collins , University of California at San Diego
pp. 76b
Session 20: Compilers and Tools

Ouroboros: A Tool for Building Generic, Hybrid, Divide and Conquer Algorithms (Abstract)

Ian Foster , University of Chicago and Argonne National Laboratory
John R. Johnson , University of Chicago and Lawrence Livermore National Laboratory
pp. 78a

BigSim: A Parallel Simulator for Performance Prediction of Extremely Large Parallel Machines (Abstract)

Gunavardhan Kakulapati , University of Illinois at Urbana-Champaign
Gengbin Zheng , University of Illinois at Urbana-Champaign
Laxmikant V. Kalé , University of Illinois at Urbana-Champaign
pp. 78b

Compiler Support for Parallel Code Generation through Kernel Recognition (Abstract)

Juan Touriño , University of A Coruña
Ramón Doallo , University of A Coruña
Manuel Arenaz , University of A Coruña
pp. 79b
Session 21: Dynamic, P2P and Selfish Protocols

A Game Theory Based Pricing Strategy for Job Allocation in Mobile Grids (Abstract)

Nirmalya Roy , University of Texas at Arlington
Kalyan Basu , University of Texas at Arlington
Preetam Ghosh , University of Texas at Arlington
Sajal K. Das , University of Texas at Arlington
pp. 82a

LessLog: A Logless File Replication Algorithm for Peer-to-Peer Distributed Systems (Abstract)

Kuang-Li Huang , National Tsing Hua University
Jerry C. Y. Chou , National Tsing Hua University
Tai-Yi Huang , National Tsing Hua University
pp. 82b

SAT-Match: A Self-Adaptive Topology Matching Method to Achieve Low Lookup Latency in Structured P2P Overlay Networks (Abstract)

Song Jiang , College of William and Mary
Shansi Ren , College of William and Mary
Lei Guo , College of William and Mary
Xiaodong Zhang , College of William and Mary
pp. 83a

Pareto Approximations for the Bicriteria Scheduling Problem (Abstract)

Michele Flammini , Università di LéAquila
Vittorio Bilò , Università di LéAquila
Luca Moscardelli , Università di LéAquila
pp. 83b

ABC: A Cluster-Based Protocol for Resource Location in Peer-to-Peer Systems (Abstract)

Hsu Wen Jing , Nanyang Technological University and Singapore-MIT Alliance Program
Hu Yahong , Singapore-MIT Alliance Program
Xu Xiang , Nanyang Technological University
pp. 84a

A General Model for Detecting Distributed Termination in Dynamic Systems (Abstract)

Xinli Wang , Michigan Technological University
Jean Mayo , Michigan Technological University
pp. 84b
Session 22: Data Mining

Dynamic Adjustment of Execution Order in Real-Time Databases (Abstract)

Qiang Wang , Chinese Academy of Sciences
Hongan Wang , Chinese Academy of Sciences
Guozhong Dai , Chinese Academy of Sciences
Yongyan Wang , Chinese Academy of Sciences
pp. 87a

Scaling and Parallelizing a Scientific Feature Mining Application Using a Cluster Middleware (Abstract)

Xuan Zhang , Ohio State University
Gagan Agrawal , Ohio State University
Leonid Glimcher , Ohio State University
pp. 87b

Improving Response Time in Cluster-Based Web Servers through Coscheduling (Abstract)

Chita R. Das , Pennsylvania State University
Jin-Ha Kim , Pennsylvania State University
Deniz Ersoz , Pennsylvania State University
Gyu Sang Choi , Pennsylvania State University
pp. 88a

Processing Rate Allocation for Proportional Slowdown Differentiation on Internet Servers (Abstract)

Cheng-Zhong Xu , Wayne State University
Xiaobo Zhou , University of Colorado
Jianbin Wei , Wayne State University
pp. 88b
Session 23: Special Purpose Architectures and Memory Systems

Evaluation of Elementary Functions Using Multimedia Features (Abstract)

Javier Hormigo , University of Malaga
Julio Villalba , University of Malaga
Gerardo Bandera , University of Malaga
Mario Gonzalez , University of Malaga
Emilio L. Zapata , University of Malaga
pp. 90a

Sparse Matrix Transpose Unit (Abstract)

Ben Juurlink , Delft University of Technology
Pyrrhos Stathis , Delft University of Technology
Stamatis Vassiliadis , Delft University of Technology
Dmitry Cheresiz , Delft University of Technology
pp. 90b

Processor-Embedded Distributed MEMS-Based Storage Systems for High-Performance I/O (Abstract)

Alok N. Choudhary , Northwestern University
Steve C. Chiu , Northwestern University
Wei-keng Liao , Northwestern University
pp. 91b

Scalable and Modular Algorithms for Floating-Point Matrix Multiplication on FPGAs (Abstract)

Viktor K. Prasanna , University of Southern California
Ling Zhuo , University of Southern California
pp. 92a
Session 24: Other Software

Re-Architecting Flow Control Adaptation for Grid Environments (Abstract)

Wu-chun Feng , Los Alamos National Laboratory
Mark K. Gardner , Los Alamos National Laboratory
Adam Engelhart , Los Alamos National Laboratory
pp. 94a

A Flexible IO Scheme for Grid Workflows (Abstract)

David Abramson , Monash University
Jagan Kommineni , Monash University
pp. 94b

Performance Measurement and Modeling of Component Applications in a High Performance Computing Environment: A Case Study (Abstract)

R. C. Armstrong , Sandia National Laboratories
N. Trebon , Sandia National Laboratories
S. Shende , University of Oregon
J. Ray , Sandia National Laboratories
A. Malony , University of Oregon
pp. 95b

Replication Under Scalable Hashing: A Family of Algorithms for Scalable Decentralized Data Distribution (Abstract)

R. J. Honicky , University of California at Santa Cruz
Ethan L. Miller , University of California at Santa Cruz
pp. 96a

Scalable High-level Caching for Parallel I/O (Abstract)

Kenin Coloma , Northwestern University
Neil Pundit , Sandia National Laboratories
Lee Ward , Sandia National Laboratories
Wei-keng Liao , Northwestern University
Eric Russell , Sandia National Laboratories
Alok Choudhary , Northwestern University
pp. 96b
Industrial Track
89 ms
(Ver 3.1 (10032016))