Issue No. 04 - October-December (2005 vol. 2)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TDSC.2005.42
Communication-induced checkpointing (CIC) protocols can be used to prevent the domino effect. Such protocols that belong to the index-based category were shown to have a better performance. In this paper, we propose an efficient index-based CIC protocol. The fully informed (FI) protocol proposed in the literature has been known to be the best index-based CIC protocol that one can achieve since the optimal protocol needs to acquire the future information. We discover that the enhancement adopted by such a protocol rarely takes effect in practice. By discarding this enhancement, we obtain a new protocol, called NMMP. Simulation results show that our protocol is almost as efficient as FI in some typical computational environments. Especially, we demonstrate that the two protocols have the same behavior over a tree communication network. Surprisingly, NMMP only has to piggyback on each message control information of constant size, regardless of the number of processes.
Index Terms- Distributed systems, fault tolerance, domino effect, communication-induced checkpointing, index-based protocols.
Jichiang Tsai, "An Efficient Index-Based Checkpointing Protocol with Constant-Size Control Information on Messages", IEEE Transactions on Dependable and Secure Computing, vol. 2, no. , pp. 287-296, October-December 2005, doi:10.1109/TDSC.2005.42