Issue No.06 - June (1997 vol.8)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/71.595580
<p><b>Abstract</b>—Consistent global checkpoints have many uses in distributed computations. A central question in applications that use consistent global checkpoints is to determine whether a consistent global checkpoint that includes a given set of local checkpoints can exist. Netzer and Xu [<ref rid="bibl062316" type="bib">16</ref>] presented the necessary and sufficient conditions under which such a consistent global checkpoint can exist, but they did not explore what checkpoints could be constructed. In this paper, we prove exactly which local checkpoints can be used for constructing such consistent global checkpoints. We illustrate the use of our results with a simple and elegant algorithm to enumerate all such consistent global checkpoints.</p>
Causality, distributed checkpointing, consistent global states, failure recovery, fault tolerance.
D. Manivannan, Robert H. B. Netzer, Mukesh Singhal, "Finding Consistent Global Checkpoints in a Distributed Computation", IEEE Transactions on Parallel & Distributed Systems, vol.8, no. 6, pp. 623-627, June 1997, doi:10.1109/71.595580