The Community for Technology Leaders
Frontiers of Massively Parallel Processing, Symposium on the (1995)
McLean, Virginia
Feb. 6, 1995 to Feb. 9, 1995
ISBN: 0-8186-6965-9
TABLE OF CONTENTS

Committees (PDF)

pp. xiii

Referees (PDF)

pp. xv
Session 1

Invited Speaker (PDF)

pp. null
Session 2A: Algorithms I

Efficient parallelizations of a competitive learning algorithm for text retrieval on the MasPar (Abstract)

S.D. Lang , Dept. of Comput. Sci., Central Florida Univ., Orlando, FL, USA
Inien Syu , Dept. of Comput. Sci., Central Florida Univ., Orlando, FL, USA
K.A. Hua , Dept. of Comput. Sci., Central Florida Univ., Orlando, FL, USA
pp. 4

Parallelization of two breadth-first search-based applications using different message-passing paradigms: an experimental evaluation (Abstract)

S. Ranka , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
Seungjo Bae , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
pp. 12

Many-to-many personalized communication with bounded traffic (Abstract)

R.V. Shankar , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
K.A. Alsabti , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
S. Ranka , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
pp. 20

A data parallel algorithm for Boolean function manipulation (Abstract)

M.S. Reorda , Dipartimento di Autom. e Inf., Politecnico di Torino, Italy
S. Gai , Dipartimento di Autom. e Inf., Politecnico di Torino, Italy
M. Rebaudengo , Dipartimento di Autom. e Inf., Politecnico di Torino, Italy
pp. 28
Session 2B: Minisymposium: HPF Vendor Perspectives, Chair: Joel Saltz, University of Maryland

Migrating CM Fortran applications to HPF (Abstract)

D. Miles , Portland Group Inc., USA
L. Meadows , Portland Group Inc., USA
pp. 37

The DEC High Performance Fortran 90 compiler front end (Abstract)

D.B. Loveman , Digital Equipment Corp., USA
pp. 46

The DOSHARED directive in CRAFT on the Cray T3D (Abstract)

T. MacDonald , Cray Res. Inc., Eagan, MN, USA
G. Elsesser , Cray Res. Inc., Eagan, MN, USA
A. Meltzer , Cray Res. Inc., Eagan, MN, USA
pp. 54
Session 3A: Computational Science I

The performance impact of false subpage sharing in KSR1 (Abstract)

B. Cukic , Dept. of Comput. Sci., Houston Univ., TX, USA
F.B. Bastani , Dept. of Comput. Sci., Houston Univ., TX, USA
pp. 64

A multi-cache coherence scheme for shuffle-exchange network based multiprocessors (Abstract)

R.A. Omran , Dept. of Comput. Sci., York Univ., UK
De-Lei Lee , Dept. of Comput. Sci., York Univ., UK
pp. 72

MICA: a mapped interconnection-cached architecture (Abstract)

Yuh-Dauh Lyuu , NEC Res. Inst., Princeton, NJ, USA
E. Schenfeld , NEC Res. Inst., Princeton, NJ, USA
pp. 80

Performance analysis and optimal system configuration of hierarchical two-level COMA multiprocessors (Abstract)

F.N. Sibai , Dept. of Electr. Eng., Akron Univ., OH, USA
Ting-Li Hu , Dept. of Electr. Eng., Akron Univ., OH, USA
pp. 90
Session 3B: Panel

Scalable I/O (PDF)

pp. null
Session 4A: I/O Related

Compilation of I/O communications for HPF (Abstract)

F. Coelho , Centre de Recherche en Inf., Ecole des Mines de Paris, Valbonne, France
pp. 102

Compiler support for out-of-core arrays on parallel machines (Abstract)

C. Koelbel , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
K. Kennedy , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
M. Paleczny , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
pp. 110

A data management approach for handling large compressed arrays in high performance computing (Abstract)

M. Winslett , Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA
K.E. Seamens , Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA
pp. 119

Parallel I/O from the user's perspective (Abstract)

J. Gotwals , Dept. of Comput. Sci., Indiana Univ., Bloomington, IN, USA
S. Yang , Dept. of Comput. Sci., Indiana Univ., Bloomington, IN, USA
S. Srinivas , Dept. of Comput. Sci., Indiana Univ., Bloomington, IN, USA
pp. 129
Session 4B: Caches

A high performance sparse Cholesky factorization algorithm for scalable parallel computers (Abstract)

G. Karypis , Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
V. Kumar , Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
pp. 140

Parallelization and performance of three-dimensional plasma simulation (Abstract)

T.A. Grotjohn , Dept. of Electr. Eng., Michigan State Univ., East Lansing, MI, USA
Yung-Kang Chu , Dept. of Electr. Eng., Michigan State Univ., East Lansing, MI, USA
V.P. Gopinath , Dept. of Electr. Eng., Michigan State Univ., East Lansing, MI, USA
D.T. Rover , Dept. of Electr. Eng., Michigan State Univ., East Lansing, MI, USA
pp. 148

Parallel molecular dynamics: communication requirements for massively parallel machines (Abstract)

K.E. Arnold , Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA
R.L. Stevens , Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA
V.E. Taylor , Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA
pp. 156

Dataparallel semi-Lagrangian numerical weather forecasting (Abstract)

L. Wolters , Dept. of Comput. Sci., Leiden Univ., Netherlands
N. Gustafsson , Dept. of Comput. Sci., Leiden Univ., Netherlands
G. Cats , Dept. of Comput. Sci., Leiden Univ., Netherlands
T. Wilhelmsson , Dept. of Comput. Sci., Leiden Univ., Netherlands
pp. 164

On mapping data and computation for parallel sparse Cholesky factorization (Abstract)

K. Eswar , Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
P. Sadayappan , Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
Chua-Huang Huang , Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
pp. 171
Session 4C: Panel
Session 5

Invited Speaker (PDF)

pp. null
Session 6A: Data Parallel

Work-efficient nested data-parallelism (Abstract)

D.W. Palmer , Dept. of Comput. Sci., North Carolina Univ., Chapel Hill, NC, USA
J.F. Prins , Dept. of Comput. Sci., North Carolina Univ., Chapel Hill, NC, USA
S. Westfold , Dept. of Comput. Sci., North Carolina Univ., Chapel Hill, NC, USA
pp. 186

A data parallel C and its platforms (Abstract)

J.D. Schlesinger , David Sarnoff Res. Center, Princeton, NJ, USA
M. Gokhale , David Sarnoff Res. Center, Princeton, NJ, USA
pp. 194

An object-oriented approach to nested data parallelism (Abstract)

T.J. Sheffler , Res. Inst. for Adv. Comput. Sci., NASA Ames Res. Center, Moffett Field, CA, USA
S. Chatterjee , Res. Inst. for Adv. Comput. Sci., NASA Ames Res. Center, Moffett Field, CA, USA
pp. 203

Braid: integrating task and data parallelism (Abstract)

E.A. West , Dept. of Comput. Sci., Virginia Univ., Charlottesville, VA, USA
A.S. Grimshaw , Dept. of Comput. Sci., Virginia Univ., Charlottesville, VA, USA
pp. 211
Session 6B: Computational Science II

Optimizing irregular computations on SIMD machines: a case study (Abstract)

M. Lynch , Dept. of Comput. & Inf. Sci., Oregon Univ., Eugene, OR, USA
J.S. Conery , Dept. of Comput. & Inf. Sci., Oregon Univ., Eugene, OR, USA
T. Hovland , Dept. of Comput. & Inf. Sci., Oregon Univ., Eugene, OR, USA
pp. 222

Parallel homologous sequence searching in large databases (Abstract)

R.L. Martino , Div. of Comput. Res. & Technol., Nat. Inst. of Health, Bethesda, MD, USA
T.K. Yap , Div. of Comput. Res. & Technol., Nat. Inst. of Health, Bethesda, MD, USA
O. Frieder , Div. of Comput. Res. & Technol., Nat. Inst. of Health, Bethesda, MD, USA
pp. 231

An optimal parallel algorithm for volume ray casting (Abstract)

V. Goel , Dept. of Comput. Sci., Central Florida Univ., Orlando, FL, USA
A. Mukherjee , Dept. of Comput. Sci., Central Florida Univ., Orlando, FL, USA
pp. 238

The performance impact of data placement for wavelet decomposition of two-dimensional image data on SIMD machines (Abstract)

J. LeMoigne , Dept. of Electr. Eng., Texas A&M Univ., College Station, TX, USA
A.K. Chan , Dept. of Electr. Eng., Texas A&M Univ., College Station, TX, USA
H.J. Lee , Dept. of Electr. Eng., Texas A&M Univ., College Station, TX, USA
C. Chui , Dept. of Electr. Eng., Texas A&M Univ., College Station, TX, USA
J.C. Liu , Dept. of Electr. Eng., Texas A&M Univ., College Station, TX, USA
T.A. El-Ghazawi , Dept. of Electr. Eng., Texas A&M Univ., College Station, TX, USA
pp. 246
Session 7

Invited Speaker (PDF)

pp. null
Session 8A: Panel

Parallel C++ (PDF)

pp. null
Session 8B: Architectural Structures

The practicality of SIMD for scientific computing (Abstract)

R.K. de Fainchtein , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
T.L. Sterling , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
C.M. Mobarry , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
J.S. Cohen , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
T. Olson , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
L.E. Harnet , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
J.A. Pedelty , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
J.R. Fischer , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
P.J. MacNeice , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
B.A. Fryxell , High Performance Comput. Branch, NASA Goddard Space Flight Center, Greenbelt, MD, USA
pp. 258

Characteristics of the MasPar parallel I/O system (Abstract)

T.A. El-Ghazawi , Dept. of Electr. Eng. & Comput. Sci., George Washington Univ., Washington, DC, USA
pp. 265

Efficient matrix operations in a reconfigurable array with spanning optical buses (Abstract)

Chunming Qiao , Dept. of Electr. & Comput. Eng., State Univ. of New York, Buffalo, NY, USA
pp. 273

Introducing MGAP-2 [Micro-Grain Array Processor] (Abstract)

M.J. Irwin , Dept. of Comput. Sci., Pennsylvania State Univ., University Park, PA, USA
T.P. Kelliher , Dept. of Comput. Sci., Pennsylvania State Univ., University Park, PA, USA
Fan Zhou , Dept. of Comput. Sci., Pennsylvania State Univ., University Park, PA, USA
R.M. Owens , Dept. of Comput. Sci., Pennsylvania State Univ., University Park, PA, USA
pp. 281
Session 9A: Networks

Analysis of cost of performing communications using various communication mechanisms (Abstract)

V.T. Prasanna , Dept. of Electr. Eng. Syst., Univ. of Southern California, Los Angeles, CA, USA
Cho-Chin Lin , Dept. of Electr. Eng. Syst., Univ. of Southern California, Los Angeles, CA, USA
pp. 290

Spectrum analysis and min-cut transformation of communication networks in parallel computers (Abstract)

Z.G. Mou , Dept. of Comput. Sci., Brandeis Univ., Waltham, MA, USA
pp. 298

Periodically regular chordal ring networks for massively parallel architectures (Abstract)

B. Parhami , Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA
pp. 315
Session 9B: Compilers

Aligning parallel arrays to reduce communication (Abstract)

J.R. Gilbert , Res. Inst. for Adv. Comput. Sci., NASA Ames Res. Center, Moffett Field, CA, USA
S. Chatterjee , Res. Inst. for Adv. Comput. Sci., NASA Ames Res. Center, Moffett Field, CA, USA
R. Schreiber , Res. Inst. for Adv. Comput. Sci., NASA Ames Res. Center, Moffett Field, CA, USA
T.J. Sheffler , Res. Inst. for Adv. Comput. Sci., NASA Ames Res. Center, Moffett Field, CA, USA
pp. 324

Code generation for multiple mappings (Abstract)

W. Pugh , Dept. of Comput. Sci., Maryland Univ., College Park, MD, USA
E. Rosser , Dept. of Comput. Sci., Maryland Univ., College Park, MD, USA
W. Kelly , Dept. of Comput. Sci., Maryland Univ., College Park, MD, USA
pp. 332

Automatic generation of efficient array redistribution routines for distributed memory multicomputers (Abstract)

S. Ramasulamy , Center for Reliable & High Performance Comput., Illinois Univ., Urbana, IL, USA
P. Banerjee , Center for Reliable & High Performance Comput., Illinois Univ., Urbana, IL, USA
pp. 342

Automatic synchronisation elimination in synchronous FORALLs (Abstract)

E.A. Heinz , IPD, Karlsruhe Univ., Germany
M. Philippsen , IPD, Karlsruhe Univ., Germany
pp. 350
Session 9C: Partitioning and Mapping

A parallel graph partitioner on a distributed memory multiprocessor (Abstract)

A. Sangiovanni-Vincentelli , California Univ., Berkeley, CA, USA
P. Buch , California Univ., Berkeley, CA, USA
J. Sanghavi , California Univ., Berkeley, CA, USA
pp. 360

Parallel remapping algorithms for adaptive problems (Abstract)

Chao-Wei Ou , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
S. Ranka , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
pp. 367

On the influence of partitioning schemes on the efficiency of overlapping domain decomposition methods (Abstract)

P. Ciarlet, Jr. , CEA, Villeneuve-St.-Georges, France
B.F. Smith , CEA, Villeneuve-St.-Georges, France
F. Lamour , CEA, Villeneuve-St.-Georges, France
pp. 375

Exploitation of control parallelism in data parallel algorithms (Abstract)

D.E. Schimmel , Sch. of Electr. and Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
V. Garg , Sch. of Electr. and Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
pp. 385
Banquet Speaker

Banquet Speaker (PDF)

pp. null
Session 10

Invited Speaker (PDF)

pp. null
Session 11A Tools I

PERFSIM: a tool for automatic performance analysis of data-parallel Fortran programs (Abstract)

S. Toledo , Lab. for Comput. Sci., MIT, Cambridge, MA, USA
pp. 396

Performance debugging based on scalability analysis (Abstract)

T. Gross , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
T. Suzuoka , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
J. Subhlok , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
pp. 406

ProcSimity: an experimental tool for processor allocation and scheduling in highly parallel systems (Abstract)

V. Lo , Dept. of Comput. Sci., Oregon Univ., Eugene, OR, USA
J.V. Miller , Dept. of Comput. Sci., Oregon Univ., Eugene, OR, USA
K. Windisch , Dept. of Comput. Sci., Oregon Univ., Eugene, OR, USA
pp. 414

Falcon: on-line monitoring and steering of large-scale parallel programs (Abstract)

N. Mallavarupu , Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
J. Stasko , Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
G. Eisenhauer , Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
Weiming Gu , Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
J. Vetter , Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
K. Schwan , Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
E. Kraemer , Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
pp. 422
Session 11B: Runtime Systems

Runtime support for data parallel tasks (Abstract)

M. Haines , Inst. for Comput. Applications in Sci. & Eng., NASA Langley Res. Center, Hampton, VA, USA
P. Mehrotra , Inst. for Comput. Applications in Sci. & Eng., NASA Langley Res. Center, Hampton, VA, USA
B. Hess , Inst. for Comput. Applications in Sci. & Eng., NASA Langley Res. Center, Hampton, VA, USA
H. Zima , Inst. for Comput. Applications in Sci. & Eng., NASA Langley Res. Center, Hampton, VA, USA
J. Van Rosendale , Inst. for Comput. Applications in Sci. & Eng., NASA Langley Res. Center, Hampton, VA, USA
pp. 432

Runtime support for execution of fine grain parallel code on coarse grain multiprocessors (Abstract)

R.B. Schnabel , Dept. of Comput. Sci., Colorado Univ., Boulder, CO, USA
R. Neves , Dept. of Comput. Sci., Colorado Univ., Boulder, CO, USA
pp. 440

Runtime support for user-level ultra lightweight threads on massively parallel distributed memory machines (Abstract)

Wei Shu , Dept. of Comput. Sci., State Univ. of New York, Buffalo, NY, USA
pp. 448

Runtime incremental parallel scheduling (RIPS) for large-scale parallel computers (Abstract)

Wei Shu , Dept. of Comput. Sci., State Univ. of New York, Buffalo, NY, USA
Min-You Wu , Dept. of Comput. Sci., State Univ. of New York, Buffalo, NY, USA
pp. 456

(Abstract)

pp. 466
Session 12A Tools II

A scalable, visual interface for debugging with event-based behavioral abstraction (Abstract)

J. Kundu , Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
J.E. Cuny , Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
pp. 472

Visualizing distributed data structures (Abstract)

S. Srinivas , Dept. of Comput. Sci., Indiana Univ., Bloomington, IN, USA
pp. 480

Migrating from PVM to MPI.I. The Unify system (Abstract)

P.L. Vaughan , NSF Eng. Res. Center for Computational Field Simulation, Mississippi State Univ., MS, USA
A. Skjellum , NSF Eng. Res. Center for Computational Field Simulation, Mississippi State Univ., MS, USA
D.S. Reese , NSF Eng. Res. Center for Computational Field Simulation, Mississippi State Univ., MS, USA
Fei-Chen Cheng , NSF Eng. Res. Center for Computational Field Simulation, Mississippi State Univ., MS, USA
pp. 488

Implementing multidisciplinary and multi-zonal applications using MPI (Abstract)

S.A. Fineberg , Comput. Sci. Corp., NASA Ames Res. Center, Moffett Field, CA, USA
pp. 496
Session 12B: Algorithms II

Time- and VLSI-optimal convex hull computation on meshes with multiple broadcasting (Abstract)

V. Bokka , Dept. of Comput. Sci., Old Dominion Univ., Norfolk, VA, USA
H. Gurla , Dept. of Comput. Sci., Old Dominion Univ., Norfolk, VA, USA
S. Olariu , Dept. of Comput. Sci., Old Dominion Univ., Norfolk, VA, USA
J.L. Schwing , Dept. of Comput. Sci., Old Dominion Univ., Norfolk, VA, USA
pp. 506

Algorithm for constructing fault-tolerant solutions of the circulant graph configuration (Abstract)

A.A. Farrag , Dept. of Math. & Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
pp. 514

Design and analysis of product networks (Abstract)

A. Youssef , Dept. of Electr. Eng. & Comput. Sci., George Washington Univ., Washington, DC, USA
pp. 521

A broadcast algorithm for all-port wormhole-routed torus networks (Abstract)

P.K. McKinley , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Yih-Jia Tsai , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
pp. 529
Session 12C: Panel

Embedded Systems (PDF)

pp. null

Author Index (PDF)

pp. 538
92 ms
(Ver )