Second IEEE International Conference on e-Science and Grid Computing (e-Science'06)
Monitoring the Earth System Grid with MDS4
Amsterdam, Netherlands
December 04-December 06
ISBN: 0-7695-2734-5
Jennifer M. Schopf, Argonne National Laboratory; The University of Chicago, USA; University of Edinburgh, UK
Mei-Hui Su, University of Southern California, USA
Neill Miller, Argonne National Laboratory; The University of Chicago, USA
In production Grids for scientific applications, service and resource failures must be detected and addressed quickly. In this paper, we describe the monitoring infrastructure used by the Earth System Grid (ESG) project, a scientific collaboration that supports global climate research. ESG uses the Globus Toolkit Monitoring and Discovery System (MDS4) to monitor its resources. We describe how the MDS4 Index Service collects information about ESG resources and how the MDS4 Trigger Service checks specified failure conditions and notifies system administrators when failures occur. We present monitoring statistics for May 2006 and describe our experiences using MDS4 to monitor ESG resources over the last two years.
Citation:
Ann Chervenak, Jennifer M. Schopf, Laura Pearlman, Mei-Hui Su, Shishir Bharathi, Luca Cinquini, Mike D'Arcy, Neill Miller, David Bernholdt, "Monitoring the Earth System Grid with MDS4," e-science, pp.69, Second IEEE International Conference on e-Science and Grid Computing (e-Science'06), 2006