Issue No.11 - November (1998 vol.9)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/71.735957
<p><b>Abstract</b>—Harnessing the computational capabilities of a network of workstations promises to off-load work from overloaded supercomputers onto largely idle resources overnight. Several capabilities are needed to do this, including support for an architecture-independent parallel programming environment, task migration, automatic resource allocation, and fault tolerance. The Hector distributed run-time environment is designed to present these capabilities transparently to programmers. MPI programs can be run under this environment on homogeneous clusters with no modifications to their source code needed. The design of Hector, its internal structure, and several benchmarks and tests are presented.</p>
Parallel computing, load balancing, fault tolerance, resource allocation, task migration.
Jonathan Robinson, Brian K. Flachs, Samuel H. Russ, "The Hector Distributed Run-Time Environment", IEEE Transactions on Parallel & Distributed Systems, vol.9, no. 11, pp. 1102-1114, November 1998, doi:10.1109/71.735957