The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.05 - May (1998 vol.47)
pp: 603-613
ABSTRACT
<p><b>Abstract</b>—Conventional schemes of rollback recovery with checkpointing for concurrent processes have overlooked an important problem: contamination of checkpoints as a result of error propagation among the cooperating processes. Error propagation is unavoidable due to imperfect detection mechanisms and random interprocess communications, and it could give rise to contaminated checkpoints which, in turn, result in unsuccessful rollbacks. To counter the problem of error propagation, a <it>damage assessment</it> model is developed to estimate the correctness of saved checkpoints under various circumstances. Using the result of damage assessment, determination of the "optimal" checkpoints for rollback recovery—which minimize the average total recovery overhead—is formulated and solved as a nonlinear integer programming problem. Integration of damage assessment into existing recovery schemes is also discussed.</p>
INDEX TERMS
Damage assessment, error propagation, rollback recovery, checkpointing, nonlinear integer programming.
CITATION
Tein-Hsiang Lin, Kang G. Shin, "Damage Assessment for Optimal Rollback Recovery", IEEE Transactions on Computers, vol.47, no. 5, pp. 603-613, May 1998, doi:10.1109/12.677255
24 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool