The Community for Technology Leaders
SC Conference (1990)
New York, NY, USA
Nov. 12, 1990 to Nov. 16, 1990
ISBN: 0-8186-2056-0
TABLE OF CONTENTS
Front Matter

Front Matter (PDF)

pp. i-xxv
Papers

LAPACK: A portable linear algebra library for high-performance computers (Abstract)

Angerson , Tennessee Univ., Knoxville, TN, USA
Bai , Tennessee Univ., Knoxville, TN, USA
Dongarra , Tennessee Univ., Knoxville, TN, USA
pp. 2-11

Hierarchical blocking and data flow analysis for numerical linear algebra (Abstract)

Chen , Center for Appl. Math., Cornell Univ., Ithaca, NY, USA
pp. 12-19

Multilinear algebra and parallel programming (Abstract)

Johnson , Dept. of Comput. Sci., City Univ. of New York, NY, USA
pp. 20-31

The impact of memory organization on the performance of matrix multiplication (Abstract)

Hake , Forschungszentrum Juelich GmbH, Germany
Homberg , Forschungszentrum Juelich GmbH, Germany
pp. 34-40

On randomly interleaved memories (Abstract)

Raghavan , Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
Hayes , Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
pp. 49-58

Tracing application program execution on the Cray X-MP and Cray 2 (Abstract)

Malony , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Larson , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Reed , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
pp. 60-73

Parallel program debugging with on-the-fly anomaly detection (Abstract)

Hood , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
Kennedy , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
Mellor-Crummey , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
pp. 74-81

Improving instruction cache behavior by reducing cache pollution (Abstract)

Gupta , North American Philips Corp., Briarcliff Manor, NY, USA
Chi , North American Philips Corp., Briarcliff Manor, NY, USA
pp. 82-91

A parallel Monte Carlo search algorithm for the conformational analysis of proteins (Abstract)

Ripoll , Biotechnol. Res. Inst., Nat. Res. Council of Canada, Montreal, Que., Canada
Thomas , Biotechnol. Res. Inst., Nat. Res. Council of Canada, Montreal, Que., Canada
pp. 94-102

Folding RNA on the Cray-2 (Abstract)

Ess , Cray Comput. Corp., Colorado Springs, CO, USA
pp. 103-111

Experience with a performance analyzer for multithreaded applications (Abstract)

Hansen , CONVEX Comput. Corp., Richardson, TX, USA
Linthicum , CONVEX Comput. Corp., Richardson, TX, USA
Brooks , CONVEX Comput. Corp., Richardson, TX, USA
pp. 124-131

Supercomputer network selection: A case study (Abstract)

French , Eli Lilly & Co., Indianapolis, IN, USA
pp. 154-159

Very high performance networking for supercomputing (Abstract)

Clinger , Solbourne Comput., Longmont, CO, USA
pp. 160-168

Cost-performance analysis of heterogeneity in supercomputer architectures (Abstract)

Menasce , Dept. de Inf., Pontifica Univ. Catolica, Rio de Janeiro, Brazil
pp. 169-177

Fast barrier synchronization hardware (Abstract)

Beckmann , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Polychronopoulos , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
pp. 180-189

Switch-stacks: A scheme for microtasking nested parallel loops (Abstract)

Chow , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Harrison , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
pp. 190-199

Parallelization of loops with exits on pipelined architectures (Abstract)

Tirumalai , Hewlett-Packard Lab., Palo Alto, CA, USA
Lee , Hewlett-Packard Lab., Palo Alto, CA, USA
Schlansker , Hewlett-Packard Lab., Palo Alto, CA, USA
pp. 200-212

Computation of large scale constrained matrix problems: the splitting equilibration algorithm (Abstract)

Nagurney , Massachusetts Univ., Amherst, MA, USA
Eydeland , Massachusetts Univ., Amherst, MA, USA
Kim , Massachusetts Univ., Amherst, MA, USA
pp. 214-223

High performance preconditioning on supercomputers for the 3D device simulator MINIMOS (Abstract)

Traar , SIEMENS AG Osterreich, Vienna, Austria
Mader , SIEMENS AG Osterreich, Vienna, Austria
pp. 224-231

Fault-tolerant routing in MIN-based supercomputers (Abstract)

Chalasani , Dept. of Electr. Eng-Syst., Univ. of Southern California, Los Angeles, CA, USA
Raghavendra , Dept. of Electr. Eng-Syst., Univ. of Southern California, Los Angeles, CA, USA
pp. 244-253

Uni-directional hypercubes (Abstract)

Chou , Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
Du , Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
pp. 254-263

Design and analysis of buffered crossbars and banyans with cut-through switching (Abstract)

Szymanski , Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
Fang , Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
pp. 264-273

A parallel object-oriented total architecture: A-NET (Abstract)

Baba , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Yoshinaga , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Iijima , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Iwamoto , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Hamada , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Suzuki , Dept. of Inf. Sci., Utsunomiya Univ., Japan
pp. 276-285

A parallel computer model supporting procedure-based communication (Abstract)

Zhou , Allied-Signal Aerosp. Technol. Center, Columbia, MD, USA
pp. 286-294

Performance estimation in a massively parallel system (Abstract)

Agrawal , AT&T Bell Lab., Murray Hill, NJ, USA
pp. 306-313

Parameterized algorithm decomposition and performance analysis (Abstract)

Harkin , Dept. of Comput. Sci., Montana State Univ., Bozeman, MT, USA
pp. 314-323

Another view on parallel speedup (Abstract)

Sun , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Ni , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
pp. 324-333

A taxonomy of concepts for evaluating chess strength (Abstract)

Berliner , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
pp. 336-343

Chess and supercomputers: details about optimizing Cray Blitz (Abstract)

Hyatt , Alabama Univ., Birmingham, AL, USA
pp. 354-363

Experiences in building the Clemson Computational Sciences Program (Abstract)

Stevenson , Clemson Univ., SC, USA
Panoff , Clemson Univ., SC, USA
pp. 366-375

A real introduction to supercomputing: a user training course (Abstract)

Hanson , Dept. of Math., Stat. & Comput. Sci., Illinois Univ., Chicago, IL, USA
pp. 376-385

Loop displacement: an approach for transforming and scheduling loops for parallel execution (Abstract)

Gupta , Dept. of Comput. Sci., Pittsburgh Univ., PA, USA
pp. 388-397

A compiler-assisted approach to SPMD execution (Abstract)

Cytron , IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
pp. 398-406

Loop distribution with arbitrary control flow (Abstract)

Kennedy , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
McKinley , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
pp. 407-416

Large scale computing on clustered vector multiprocessors (Abstract)

Kamel , IBM Sci. Center, Bergen, Norway
pp. 418-427

Perfect Benchmarks decomposition and performance on VAX multiprocessors (Abstract)

Cvetanovic , Digital Equipment Corp., Maynard, MA, USA
Freedman , Digital Equipment Corp., Maynard, MA, USA
Nofsinger , Digital Equipment Corp., Maynard, MA, USA
pp. 455-464

Efficient decomposition and performance of parallel PDE, FFT, Monte Carlo simulations, simplex, and sparse solvers (Abstract)

Cvetanovic , Digital Equipment Corp., Maynard, MA, USA
Freedman , Digital Equipment Corp., Maynard, MA, USA
Nofsinger , Digital Equipment Corp., Maynard, MA, USA
pp. 465-474

Embedding meshes on the star graph (Abstract)

Ranka , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
Wang , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
Yeh , Sch. of Comput. & Inf. Sci., Syracuse Univ., NY, USA
pp. 476-485

Logarithmic time cost optimal parallel sorting is not yet fast in practice (Abstract)

Natvig , Norwegian Inst. of Technol., Trondheim Univ., Norway
pp. 486-494

A simple and correct shared-queue algorithm using compare-and-swap (Abstract)

Stone , IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
pp. 495-504

Fine-grain parallelism in the ALPS programming language (Abstract)

Vishnubhotla , Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
pp. 506-514

Delirium: an embedding coordination language (Abstract)

Lucco , Dept. of Comput. Sci., California Univ., Berkeley, CA, USA
Sharp , Dept. of Comput. Sci., California Univ., Berkeley, CA, USA
pp. 515-524

UC: a language for the connection machine (Abstract)

Bagrodia , Dept. of Comput. Sci., California Univ., Los Angeles, CA, USA
pp. 525-534

Parallel algorithm research at CERFACS (Abstract)

Duff , Rutherford Appleton Lab., Chilton, UK
pp. 536-542

A write update cache coherence protocol for MIN-based multiprocessors with accessibility-based split caches (Abstract)

Algudady , Dept. of Electr. & Comput. Eng., Pennsylvania State Univ., University Park, PA, USA
Das , Dept. of Electr. & Comput. Eng., Pennsylvania State Univ., University Park, PA, USA
Thazhuthaveetil , Dept. of Electr. & Comput. Eng., Pennsylvania State Univ., University Park, PA, USA
pp. 544-553

Cache coherence in systems with parallel communication channels and many processors (Abstract)

Willis , Philips Lab., Briarcliff Manor, NY, USA
Sanderson , Philips Lab., Briarcliff Manor, NY, USA
Hill , Philips Lab., Briarcliff Manor, NY, USA
pp. 554-563

Data cache performance of supercomputer applications (Abstract)

Callahan , Tera Comput. Co., Seattle, WA, USA
Porterfield , Tera Comput. Co., Seattle, WA, USA
pp. 564-572

Resource binding-a universal approach to parallel programming (Abstract)

Shing , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Ni , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
pp. 574-583

A flexible communication abstraction for nonshared memory parallel computing (Abstract)

Alverson , Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
Griswold , Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
Notkin , Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
Snyder , Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
pp. 584-593

Implementation machine paradigm for parallel programming (Abstract)

Rao , Carnegie Mellon Univ., Pittsburgh, PA, USA
Segall , Carnegie Mellon Univ., Pittsburgh, PA, USA
Vrsalovic , Carnegie Mellon Univ., Pittsburgh, PA, USA
pp. 594-603

Efficient parallel logic simulation techniques for the Connection Machine (Abstract)

Chung , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Chung , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
pp. 606-614

Design of a scalable parallel switch-level simulator for VLSI (Abstract)

Mueller-Thuns , Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
Saab , Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
pp. 615-624

SISAL versus Fortran: a comparison using the Livermore Loops (Abstract)

Cann , Lawrence Livermore Nat. Lab., CA, USA
Feo , Lawrence Livermore Nat. Lab., CA, USA
pp. 626-636

Experimental analysis of communication/data-conditional aspects of a mixed-mode parallel architecture via synthetic computations (Abstract)

Fineberg , Dept. of Electr. & Comput. Eng., Iowa Univ., Iowa City, IA, USA
Casavant , Dept. of Electr. & Comput. Eng., Iowa Univ., Iowa City, IA, USA
pp. 637-646

Performance evaluation of mesh-connected wormhole-routed networks for interprocessor communication in multicomputers (Abstract)

Chittor , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Enbody , Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
pp. 647-656

Theorem proving in propositional logic on vector computers using a generalized Davis-Putman procedure (Abstract)

Wen-Tsuen Chen , Inst. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Ming-Yi Fang , Inst. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
pp. 658-665

Scan primitives for vector computers (Abstract)

Chatterjee , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Blelloch , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Zagha , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
pp. 666-675

A vectorized long-period shift-register random number generator (Abstract)

Filippone , IBM Eur. Center for Sci. & Eng. Comput., Rome, Italy
Santangelo , IBM Eur. Center for Sci. & Eng. Comput., Rome, Italy
Vitaletti , IBM Eur. Center for Sci. & Eng. Comput., Rome, Italy
pp. 676-684

Massively parallel computational methods in light scattering by small particles (Abstract)

Potter , Dept. of Electr. & Comput. Eng., Clarkson Univ., Potsdam, NY, USA
Cline , Dept. of Electr. & Comput. Eng., Clarkson Univ., Potsdam, NY, USA
pp. 686-692

MONT3E: A Monte Carlo electron heat transfer code (Abstract)

Maltby , Lawrence Livermore Nat. Lab., CA, USA
Kornblum , Lawrence Livermore Nat. Lab., CA, USA
pp. 700-707

A fiber optic hypermesh for SIMD/MIMD machines (Abstract)

Szymanski , Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
pp. 710-719

A message passing coprocessor for distributed memory multicomputers (Abstract)

Hsu , Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
Banerjee , Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
pp. 720-729

Architectural support for register allocation in the presence of aliasing (Abstract)

Heggy , Dept. of Comput. Sci., Pittsburgh Univ., PA, USA
Soffa , Dept. of Comput. Sci., Pittsburgh Univ., PA, USA
pp. 730-739

An optimal hypercube direct N-body solver on the Connection Machine (Abstract)

Brunet , Thinking Machines Corp., Cambridge, MA, USA
Mesirov , Thinking Machines Corp., Cambridge, MA, USA
pp. 748-752

P3D: A Lisp-based format for representing general 3D models (Abstract)

Welling , Pittsburgh Supercomput. Center, PA, USA
pp. 766-774

Scientific data visualization: a formal introduction to the rendering and geometric modeling aspects (Abstract)

Harrand , Alabama Univ., Huntsville, AL, USA
Choudry , Alabama Univ., Huntsville, AL, USA
Ziebarth , Alabama Univ., Huntsville, AL, USA
pp. 775-783

Run-time monitoring of concurrent programs on the Cedar multiprocessor (Abstract)

Sharma , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Malony , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Berry , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
Sinvhal-Sharma , Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
pp. 784-793

Future general purpose supercomputer architectures (Abstract)

Smith , Cray Res. Inc., Chippewa Fallls, WI, USA
Hsu , Cray Res. Inc., Chippewa Fallls, WI, USA
Hsiung , Cray Res. Inc., Chippewa Fallls, WI, USA
pp. 796-804

Building black holes, gravitational waves, and relativistic fluid flows: supercomputer cinema (Abstract)

Shapiro , Cornell Univ., Ithaca, NY, USA
Teukolsky , Cornell Univ., Ithaca, NY, USA
pp. 805-814

Quantum molecular modeling with simulated annealing-A distributed processing and visualization application (Abstract)

Hohl , Nat. Center for Supercomput. Applications, Illinois Univ., Urbana, IL, USA
Idaszak , Nat. Center for Supercomput. Applications, Illinois Univ., Urbana, IL, USA
pp. 816-825

Time dilation visualization in relativity (Abstract)

Hsiung , Carnegie Mellon Univ., Pittsburgh, PA, USA
Thibadeau , Carnegie Mellon Univ., Pittsburgh, PA, USA
Cox , Carnegie Mellon Univ., Pittsburgh, PA, USA
Dunn , Carnegie Mellon Univ., Pittsburgh, PA, USA
pp. 835-844

Partitioning declarative programs into communicating processes (Abstract)

Roy , Dept. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
Nagel , Dept. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
Bic , Dept. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
pp. 846-855

Parallel processing of near fine grain tasks using static scheduling on OSCAR (optimally scheduled advanced multiprocessor) (Abstract)

Kasahara , Dept. of Electr. Eng., Waseda Univ., Tokyo, Japan
Honda , Dept. of Electr. Eng., Waseda Univ., Tokyo, Japan
Narita , Dept. of Electr. Eng., Waseda Univ., Tokyo, Japan
pp. 856-864

Generating explicit communication from shared-memory program references (Abstract)

Li , Dept. of Comput. Sci., Yale Univ., New Haven, CT, USA
Chen , Dept. of Comput. Sci., Yale Univ., New Haven, CT, USA
pp. 865-876

A network-topology independent task allocation strategy for parallel computers (Abstract)

Baba , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Iwamoto , Dept. of Inf. Sci., Utsunomiya Univ., Japan
Yoshinaga , Dept. of Inf. Sci., Utsunomiya Univ., Japan
pp. 878-887

A semi distributed task allocation strategy for large hypercube supercomputers (Abstract)

Ahmad , Syracuse Univ., NY, USA
Ghafoor , Syracuse Univ., NY, USA
pp. 898-907

Architecture and implementation of a VLIW supercomputer (Abstract)

Colwell , Multiflow Comput. Inc., Branford, CT, USA
Hall , Multiflow Comput. Inc., Branford, CT, USA
Joshi , Multiflow Comput. Inc., Branford, CT, USA
Papworth , Multiflow Comput. Inc., Branford, CT, USA
Rodman , Multiflow Comput. Inc., Branford, CT, USA
Tornes , Multiflow Comput. Inc., Branford, CT, USA
pp. 910-919

The design of a RISC based multiprocessor chip (Abstract)

Gupta , Dept. of Comput. Sci., Pittsburgh Univ., PA, USA
pp. 920-929

Soviet high-speed computers: the new generation (Abstract)

Wolcott , MIS Dept., Arizona Univ., Tucson, AZ, USA
Goodman , MIS Dept., Arizona Univ., Tucson, AZ, USA
pp. 930-939

Performing data flow analysis in parallel (Abstract)

Lee , Dept. of Comput. Sci., Rutgers Univ., New Brunswick, NJ, USA
Marlowe , Dept. of Comput. Sci., Rutgers Univ., New Brunswick, NJ, USA
Ryder , Dept. of Comput. Sci., Rutgers Univ., New Brunswick, NJ, USA
pp. 942-951

Experience with interprocedural analysis of array side effects (Abstract)

Havlak , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
Kennedy , Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
pp. 952-961

Subdomain dependence test for massive parallelism (Abstract)

Lu , Dept. of Comput. Sci., Yale Univ., New Haven CT, USA
Chen , Dept. of Comput. Sci., Yale Univ., New Haven CT, USA
pp. 962-972
Author Index

Author Index (PDF)

pp. 973-982
90 ms
(Ver 3.3 (11022016))