The Community for Technology Leaders
SC Conference (1990)
New York, NY, USA
Nov. 12, 1990 to Nov. 16, 1990
ISBN: 0-8186-2056-0
TABLE OF CONTENTS
Front Matter

Front Matter (PDF)

pp. i-xxv
Papers

LAPACK: A portable linear algebra library for high-performance computers (Abstract)

Bai , Tennessee Univ., Knoxville, TN, USA
Dongarra , Tennessee Univ., Knoxville, TN, USA
Angerson , Tennessee Univ., Knoxville, TN, USA
pp. 2-11

Hierarchical blocking and data flow analysis for numerical linear algebra (Abstract)

Chen , Center for Appl. Math., Cornell Univ., Ithaca, NY, USA
pp. 12-19

Multilinear algebra and parallel programming (Abstract)

Johnson , Dept. of Comput. Sci., City Univ. of New York, NY, USA
pp. 20-31

The impact of memory organization on the performance of matrix multiplication (Abstract)

Hake , Forschungszentrum Juelich GmbH, Germany
Homberg , Forschungszentrum Juelich GmbH, Germany
pp. 34-40

On randomly interleaved memories (Abstract)

Raghavan , Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
Hayes , Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
pp. 49-58

Tracing application program execution on the Cray X-MP and Cray 2 (Abstract)

Malony , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Larson , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Reed , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
pp. 60-73

Parallel program debugging with on-the-fly anomaly detection (Abstract)

Mellor-Crummey , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
Hood , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
Kennedy , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
pp. 74-81

Improving instruction cache behavior by reducing cache pollution (Abstract)

Chi , North American Philips Corp., Briarcliff Manor, NY, USA
Gupta , North American Philips Corp., Briarcliff Manor, NY, USA
pp. 82-91

A parallel Monte Carlo search algorithm for the conformational analysis of proteins (Abstract)

Ripoll , Biotechnol. Res. Inst., Nat. Res. Council of Canada, Montreal, Que., Canada
Thomas , Biotechnol. Res. Inst., Nat. Res. Council of Canada, Montreal, Que., Canada
pp. 94-102

Folding RNA on the Cray-2 (Abstract)

Ess , Cray Comput. Corp., Colorado Springs, CO, USA
pp. 103-111

Experience with a performance analyzer for multithreaded applications (Abstract)

Brooks , CONVEX Comput. Corp., Richardson, TX, USA
Linthicum , CONVEX Comput. Corp., Richardson, TX, USA
Hansen , CONVEX Comput. Corp., Richardson, TX, USA
pp. 124-131

Supercomputer network selection: A case study (Abstract)

French , Eli Lilly & Co., Indianapolis, IN, USA
pp. 154-159

Very high performance networking for supercomputing (Abstract)

Clinger , Solbourne Comput., Longmont, CO, USA
pp. 160-168

Cost-performance analysis of heterogeneity in supercomputer architectures (Abstract)

Menasce , Dept. de Inf., Pontifica Univ. Catolica, Rio de Janeiro, Brazil
pp. 169-177

Fast barrier synchronization hardware (Abstract)

Beckmann , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Polychronopoulos , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
pp. 180-189

Switch-stacks: A scheme for microtasking nested parallel loops (Abstract)

Harrison , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Chow , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
pp. 190-199

Parallelization of loops with exits on pipelined architectures (Abstract)

Schlansker , Hewlett-Packard Lab., Palo Alto, CA, USA
Lee , Hewlett-Packard Lab., Palo Alto, CA, USA
Tirumalai , Hewlett-Packard Lab., Palo Alto, CA, USA
pp. 200-212

Computation of large scale constrained matrix problems: the splitting equilibration algorithm (Abstract)

Nagurney , Massachusetts Univ., Amherst, MA, USA
Eydeland , Massachusetts Univ., Amherst, MA, USA
Kim , Massachusetts Univ., Amherst, MA, USA
pp. 214-223

High performance preconditioning on supercomputers for the 3D device simulator MINIMOS (Abstract)

Mader , SIEMENS AG Osterreich, Vienna, Austria
Traar , SIEMENS AG Osterreich, Vienna, Austria
pp. 224-231

Fault-tolerant routing in MIN-based supercomputers (Abstract)

Chalasani , Dept. of Electr. Eng-Syst., Univ. of Southern California, Los Angeles, CA, USA
Raghavendra , Dept. of Electr. Eng-Syst., Univ. of Southern California, Los Angeles, CA, USA
pp. 244-253

Uni-directional hypercubes (Abstract)

Du , Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
Chou , Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
pp. 254-263

Design and analysis of buffered crossbars and banyans with cut-through switching (Abstract)

Fang , Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
Szymanski , Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
pp. 264-273

A parallel object-oriented total architecture: A-NET (Abstract)

Hamada , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Yoshinaga , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Suzuki , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Baba , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Iijima , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Iwamoto , Dept. of Inf. Sci., Utsunomiya Univ., Japan
pp. 276-285

A parallel computer model supporting procedure-based communication (Abstract)

Zhou , Allied-Signal Aerosp. Technol. Center, Columbia, MD, USA
pp. 286-294

Performance estimation in a massively parallel system (Abstract)

Agrawal , AT&T Bell Lab., Murray Hill, NJ, USA
pp. 306-313

Parameterized algorithm decomposition and performance analysis (Abstract)

Harkin , Dept. of Comput. Sci., Montana State Univ., Bozeman, MT, USA
pp. 314-323

Another view on parallel speedup (Abstract)

Sun , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Ni , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
pp. 324-333

A taxonomy of concepts for evaluating chess strength (Abstract)

Berliner , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
pp. 336-343

Chess and supercomputers: details about optimizing Cray Blitz (Abstract)

Hyatt , Alabama Univ., Birmingham, AL, USA
pp. 354-363

Experiences in building the Clemson Computational Sciences Program (Abstract)

Stevenson , Clemson Univ., SC, USA
Panoff , Clemson Univ., SC, USA
pp. 366-375

A real introduction to supercomputing: a user training course (Abstract)

Hanson , Dept. of Math., Stat. & Comput. Sci., Illinois Univ., Chicago, IL, USA
pp. 376-385

Loop displacement: an approach for transforming and scheduling loops for parallel execution (Abstract)

Gupta , Dept. of Comput. Sci., Pittsburgh Univ., PA, USA
pp. 388-397

A compiler-assisted approach to SPMD execution (Abstract)

Cytron , IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
pp. 398-406

Loop distribution with arbitrary control flow (Abstract)

McKinley , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
Kennedy , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
pp. 407-416

Large scale computing on clustered vector multiprocessors (Abstract)

Kamel , IBM Sci. Center, Bergen, Norway
pp. 418-427

Perfect Benchmarks decomposition and performance on VAX multiprocessors (Abstract)

Nofsinger , Digital Equipment Corp., Maynard, MA, USA
Freedman , Digital Equipment Corp., Maynard, MA, USA
Cvetanovic , Digital Equipment Corp., Maynard, MA, USA
pp. 455-464

Efficient decomposition and performance of parallel PDE, FFT, Monte Carlo simulations, simplex, and sparse solvers (Abstract)

Nofsinger , Digital Equipment Corp., Maynard, MA, USA
Cvetanovic , Digital Equipment Corp., Maynard, MA, USA
Freedman , Digital Equipment Corp., Maynard, MA, USA
pp. 465-474

Embedding meshes on the star graph (Abstract)

Ranka , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
Wang , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
Yeh , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
pp. 476-485

Logarithmic time cost optimal parallel sorting is not yet fast in practice (Abstract)

Natvig , Norwegian Inst. of Technol., Trondheim Univ., Norway
pp. 486-494

A simple and correct shared-queue algorithm using compare-and-swap (Abstract)

Stone , IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
pp. 495-504

Fine-grain parallelism in the ALPS programming language (Abstract)

Vishnubhotla , Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
pp. 506-514

Delirium: an embedding coordination language (Abstract)

Sharp , Dept. of Comput. Sci., California Univ., Berkeley, CA, USA
Lucco , Dept. of Comput. Sci., California Univ., Berkeley, CA, USA
pp. 515-524

UC: a language for the connection machine (Abstract)

Bagrodia , Dept. of Comput. Sci., California Univ., Los Angeles, CA, USA
pp. 525-534

Parallel algorithm research at CERFACS (Abstract)

Duff , Rutherford Appleton Lab., Chilton, UK
pp. 536-542

A write update cache coherence protocol for MIN-based multiprocessors with accessibility-based split caches (Abstract)

Algudady , Dept. of Electr. & Comput. Eng., Pennsylvania State Univ., University Park, PA, USA
Thazhuthaveetil , Dept. of Electr. & Comput. Eng., Pennsylvania State Univ., University Park, PA, USA
Das , Dept. of Electr. & Comput. Eng., Pennsylvania State Univ., University Park, PA, USA
pp. 544-553

Cache coherence in systems with parallel communication channels and many processors (Abstract)

Hill , Philips Lab., Briarcliff Manor, NY, USA
Sanderson , Philips Lab., Briarcliff Manor, NY, USA
Willis , Philips Lab., Briarcliff Manor, NY, USA
pp. 554-563

Data cache performance of supercomputer applications (Abstract)

Callahan , Tera Comput. Co., Seattle, WA, USA
Porterfield , Tera Comput. Co., Seattle, WA, USA
pp. 564-572

Resource binding-a universal approach to parallel programming (Abstract)

Shing , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Ni , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
pp. 574-583

A flexible communication abstraction for nonshared memory parallel computing (Abstract)

Alverson , Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
Notkin , Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
Griswold , Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
Snyder , Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
pp. 584-593

Implementation machine paradigm for parallel programming (Abstract)

Vrsalovic , Carnegie Mellon Univ., Pittsburgh, PA, USA
Segall , Carnegie Mellon Univ., Pittsburgh, PA, USA
Rao , Carnegie Mellon Univ., Pittsburgh, PA, USA
pp. 594-603

Efficient parallel logic simulation techniques for the Connection Machine (Abstract)

Chung , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Chung , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
pp. 606-614

Design of a scalable parallel switch-level simulator for VLSI (Abstract)

Saab , Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
Mueller-Thuns , Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
pp. 615-624

SISAL versus Fortran: a comparison using the Livermore Loops (Abstract)

Feo , Lawrence Livermore Nat. Lab., CA, USA
Cann , Lawrence Livermore Nat. Lab., CA, USA
pp. 626-636

Experimental analysis of communication/data-conditional aspects of a mixed-mode parallel architecture via synthetic computations (Abstract)

Casavant , Dept. of Electr. & Comput. Eng., Iowa Univ., Iowa City, IA, USA
Fineberg , Dept. of Electr. & Comput. Eng., Iowa Univ., Iowa City, IA, USA
pp. 637-646

Performance evaluation of mesh-connected wormhole-routed networks for interprocessor communication in multicomputers (Abstract)

Chittor , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Enbody , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
pp. 647-656

Theorem proving in propositional logic on vector computers using a generalized Davis-Putman procedure (Abstract)

Ming-Yi Fang , Inst. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Wen-Tsuen Chen , Inst. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
pp. 658-665

Scan primitives for vector computers (Abstract)

Chatterjee , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Blelloch , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Zagha , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
pp. 666-675

A vectorized long-period shift-register random number generator (Abstract)

Vitaletti , IBM Eur. Center for Sci. & Eng. Comput., Rome, Italy
Filippone , IBM Eur. Center for Sci. & Eng. Comput., Rome, Italy
Santangelo , IBM Eur. Center for Sci. & Eng. Comput., Rome, Italy
pp. 676-684

Massively parallel computational methods in light scattering by small particles (Abstract)

Potter , Dept. of Electr. & Comput. Eng., Clarkson Univ., Potsdam, NY, USA
Cline , Dept. of Electr. & Comput. Eng., Clarkson Univ., Potsdam, NY, USA
pp. 686-692

MONT3E: A Monte Carlo electron heat transfer code (Abstract)

Maltby , Lawrence Livermore Nat. Lab., CA, USA
Kornblum , Lawrence Livermore Nat. Lab., CA, USA
pp. 700-707

A fiber optic hypermesh for SIMD/MIMD machines (Abstract)

Szymanski , Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
pp. 710-719

A message passing coprocessor for distributed memory multicomputers (Abstract)

Hsu , Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
Banerjee , Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
pp. 720-729

Architectural support for register allocation in the presence of aliasing (Abstract)

Heggy , Dept. of Comput. Sci., Pittsburgh Univ., PA, USA
Soffa , Dept. of Comput. Sci., Pittsburgh Univ., PA, USA
pp. 730-739

An optimal hypercube direct N-body solver on the Connection Machine (Abstract)

Mesirov , Thinking Machines Corp., Cambridge, MA, USA
Brunet , Thinking Machines Corp., Cambridge, MA, USA
pp. 748-752

P3D: A Lisp-based format for representing general 3D models (Abstract)

Welling , Pittsburgh Supercomput. Center, PA, USA
pp. 766-774

Scientific data visualization: a formal introduction to the rendering and geometric modeling aspects (Abstract)

Choudry , Alabama Univ., Huntsville, AL, USA
Harrand , Alabama Univ., Huntsville, AL, USA
Ziebarth , Alabama Univ., Huntsville, AL, USA
pp. 775-783

Run-time monitoring of concurrent programs on the Cedar multiprocessor (Abstract)

Berry , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Sharma , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Malony , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Sinvhal-Sharma , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
pp. 784-793

Future general purpose supercomputer architectures (Abstract)

Hsiung , Cray Res. Inc., Chippewa Fallls, WI, USA
Smith , Cray Res. Inc., Chippewa Fallls, WI, USA
Hsu , Cray Res. Inc., Chippewa Fallls, WI, USA
pp. 796-804

Building black holes, gravitational waves, and relativistic fluid flows: supercomputer cinema (Abstract)

Teukolsky , Cornell Univ., Ithaca, NY, USA
Shapiro , Cornell Univ., Ithaca, NY, USA
pp. 805-814

Quantum molecular modeling with simulated annealing-A distributed processing and visualization application (Abstract)

Hohl , Nat. Center for Supercomput. Applications, Illinois Univ., Urbana, IL, USA
Idaszak , Nat. Center for Supercomput. Applications, Illinois Univ., Urbana, IL, USA
pp. 816-825

Time dilation visualization in relativity (Abstract)

Dunn , Carnegie Mellon Univ., Pittsburgh, PA, USA
Hsiung , Carnegie Mellon Univ., Pittsburgh, PA, USA
Cox , Carnegie Mellon Univ., Pittsburgh, PA, USA
Thibadeau , Carnegie Mellon Univ., Pittsburgh, PA, USA
pp. 835-844

Partitioning declarative programs into communicating processes (Abstract)

Bic , Dept. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
Nagel , Dept. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
Roy , Dept. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
pp. 846-855

Parallel processing of near fine grain tasks using static scheduling on OSCAR (optimally scheduled advanced multiprocessor) (Abstract)

Kasahara , Dept. of Electr. Eng., Waseda Univ., Tokyo, Japan
Narita , Dept. of Electr. Eng., Waseda Univ., Tokyo, Japan
Honda , Dept. of Electr. Eng., Waseda Univ., Tokyo, Japan
pp. 856-864

Generating explicit communication from shared-memory program references (Abstract)

Li , Dept. of Comput. Sci., Yale Univ., New Haven, CT, USA
Chen , Dept. of Comput. Sci., Yale Univ., New Haven, CT, USA
pp. 865-876

A network-topology independent task allocation strategy for parallel computers (Abstract)

Baba , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Iwamoto , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Yoshinaga , Dept. of Inf. Sci., Utsunomiya Univ., Japan
pp. 878-887

A semi distributed task allocation strategy for large hypercube supercomputers (Abstract)

Ghafoor , Syracuse Univ., NY, USA
Ahmad , Syracuse Univ., NY, USA
pp. 898-907

Architecture and implementation of a VLIW supercomputer (Abstract)

Colwell , Multiflow Comput. Inc., Branford, CT, USA
Joshi , Multiflow Comput. Inc., Branford, CT, USA
Tornes , Multiflow Comput. Inc., Branford, CT, USA
Hall , Multiflow Comput. Inc., Branford, CT, USA
Rodman , Multiflow Comput. Inc., Branford, CT, USA
Papworth , Multiflow Comput. Inc., Branford, CT, USA
pp. 910-919

The design of a RISC based multiprocessor chip (Abstract)

Gupta , Dept. of Comput. Sci., Pittsburgh Univ., PA, USA
pp. 920-929

Soviet high-speed computers: the new generation (Abstract)

Goodman , MIS Dept., Arizona Univ., Tucson, AZ, USA
Wolcott , MIS Dept., Arizona Univ., Tucson, AZ, USA
pp. 930-939

Performing data flow analysis in parallel (Abstract)

Ryder , Dept. of Comput. Sci., Rutgers Univ., New Brunswick, NJ, USA
Lee , Dept. of Comput. Sci., Rutgers Univ., New Brunswick, NJ, USA
Marlowe , Dept. of Comput. Sci., Rutgers Univ., New Brunswick, NJ, USA
pp. 942-951

Experience with interprocedural analysis of array side effects (Abstract)

Kennedy , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
Havlak , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
pp. 952-961

Subdomain dependence test for massive parallelism (Abstract)

Chen , Dept. of Comput. Sci., Yale Univ., New Haven CT, USA
Lu , Dept. of Comput. Sci., Yale Univ., New Haven CT, USA
pp. 962-972
Author Index

Author Index (PDF)

pp. 973-982
98 ms
(Ver )