Issue No. 09 - September (2006 vol. 7)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/MDSO.2006.56
Raphael Y. de Camargo , University of Sao Paulo
Fabio Kon , University of Sao Paulo
Renato Cerqueira , Pontifical Catholic University of Rio de Janeiro
This article compares several strategies for storing checkpoint data from parallel applications in an opportunistic grid environment. In terms of computational overhead, storage overhead, and degree of fault tolerance, the authors evaluate the use of replication, parity information, and erasure coding. They use an object-oriented grid middleware solution called InteGrade to implement these strategies and to perform the evaluation experiments.
fault tolerance, distributed storage, data coding, checkpointing, grid computing
R. Cerqueira, R. Y. de Camargo and F. Kon, "Strategies for Checkpoint Storage on Opportunistic Grids," in IEEE Distributed Systems Online, vol. 7, no. , pp. 1, 2006.