Fourth IEEE International Conference on Cluster Computing (CLUSTER'02)
An Extensible, Portable, Scalable Cluster Management Software Architecture
Chicago, Illinois
September 23-September 26
ISBN: 0-7695-1745-5
This paper describes an object-oriented software architecture for cluster integration and management that enables extensibility, portability, and scalability. This architecture has been successfully implemented and deployed on several large-scale production clusters at Sandia National Laboratories, the largest of which is currently 1861 nodes. This paper discusses the key features of the architecture that allow for easily extending the range of supported hardware devices and network topologies. We also describe in detail how the object-oriented structure that represents the hardware components can be used to implement scalable and portable cluster management tools.
Index Terms:
Cluster Management, Commodity Clusters, Cluster Architecture, Commercial Off The Shelf (COTS)
Citation:
James H. Laros III, Lee Ward, Nathan W. Dauchy, Ron Brightwell, Trammell Hudson, Ruth Klundt, "An Extensible, Portable, Scalable Cluster Management Software Architecture," cluster, pp.287, Fourth IEEE International Conference on Cluster Computing (CLUSTER'02), 2002