The Community for Technology Leaders
SC Conference (1999)
Portland, Oregon, USA
Nov. 13, 1999 to Nov. 18, 1999
ISBN: 1-58113-091-0
pp: 5
G Mahinthakumar , Oak Ridge National Laboratory
Nicholas T Karonis , Northern Illinois University
Forrest M Hoffman , Oak Ridge National Laboratory
William W Hargrove , University of Tennessee
The authors present a metacomputing application of multivariate, nonhierarchical statistical clustering to geographic environmental data from the 48 conterminous United States in order to produce maps of regions of ecological similarity, called ecoregions. These maps represent finer scale regionalizations than do those generated by the traditional technique: an expert with a marker pen. Several variables (e.g., temperature, organic matter, rainfall etc.) thought to affect the growth of vegetation are clustered at resolutions as fine as one square kilometer (1 km<sup>2</sup>). These data can represent over 7.8 million map cells in an n-dimensional (n = 9 to 25) data space. A parallel version of the iterative statistical clustering algorithm is developed by the authors using the MPI (Message Passing Interface) message passing routines. The parallel algorithm uses a classical, self-scheduling, single-program, multiple data (SPMD) organization; performs dynamic load balancing for reasonable performance in heterogeneous metacomputing environments; and provides fault tolerance by saving intermediate results for easy restarts in case of hardware failure. The parallel algorithm was tested on various geographically distributed heterogeneous metacomputing configurations involving an IBM SP3<sup>TM</sup>, an IBM SP2<sup>TM</sup>, and two SGI Origin 2000<sup>TM</sup> 's. The tests were performed with minimal code modification, and were made possible by Globus<sup>TM</sup> (a metacomputing software toolkit) and the Globus-enabled version of MPI (MPICH-G). Our performance tests indicate that while the algorithm works reasonably well under the metacomputing environment for a moderate number of processors, the communication overhead can become prohibitive for large processor configurations.
G Mahinthakumar, Nicholas T Karonis, Forrest M Hoffman, William W Hargrove, "Multivariate Geographic Clustering in A Metacomputing Environment Using Globus", SC Conference, vol. 00, no. , pp. 5, 1999, doi:10.1109/SC.1999.10014
100 ms
(Ver 3.3 (11022016))