This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Roll-Forward and Rollback Recovery: Performance-Reliability Trade-Off
March 1997 (vol. 46 no. 3)
pp. 372-378

Abstract—Performance and reliability trade-offs depend on the recovery scheme used in any fault-tolerant system. Gain in performance, using comparable resources, typically requires sacrifice in reliability, and vice-versa. Roll-forward schemes for duplex systems achieve better performance than rollback schemes, without a significant increase in hardware resource requirements. This paper compares two roll-forward schemes with two roll-back schemes. It is shown that the roll-forward schemes improve performance with only a small loss in reliability as compared to rollback schemes.

[1] P. Agrawal, "Fault Tolerance in Multiprocessor Systems without Dedicated Redundancy," IEEE Trans. Computers, vol. 37, no. 3, pp. 358-362, Mar. 1988.
[2] B.W. Johnson, Design and Analysis of Fault-Tolerant Digital Systems, pp. 394-402. Reading, Mass.: Addison-Wesley, June 1989.
[3] J. Long, W.K. Fuchs, and J.A. Abraham, "Forward Recovery Using Checkpointing in Parallel Systems," Proc. Int'l Conf. Parallel Processing, pp. 272-275, Aug. 1990.
[4] J. Long, W.K. Fuchs, and J.A. Abraham, "Compiler-Assisted Static Checkpoint Insertion," Proc. FTC'92, pp. 58-65, July 1992.
[5] D.K. Pradhan and N.H. Vaidya, "Roll-Forward and Rollback Recovery: Performance-Reliability Trade-Off," Proc. 24th IEEE Int'l Symp. Fault-Tolerant Computing, June 1994.
[6] D.K. Pradhan and N.H. Vaidya, "Roll-Forward Checkpointing Scheme: A Novel Fault-Tolerant Architecture," IEEE Trans. Computers, vol. 43, no. 10, pp. 1,163-1,174, Oct. 1994.
[7] N.H. Vaidya, "Low-Cost Schemes for Fault Tolerance," PhD thesis, Univ. of Massachusetts-Amherst, Feb. 1993.
[8] D.K. Pradhan, "Redundancy Schemes for Recovery," Technical Report TR-89-CSE-16, Electrical and Comp. Eng., Univ. of Massachusetts, Amherst, 1989.
[9] D.K. Pradhan, “Fault-Tolerant Computing: Theory and Techniques,” vol. II, pp. 492-496, chapter 6. Prentice Hall, 1986.
[10] D.K. Pradhan, D.D. Sharma, and N.H. Vaidya, "Roll-Forward Checkpointing Schemes," Hardware and Software Architectures for Fault Tolerance, M. Banatre and P. Lee, eds., Lecture Notes in Computer Science. Springer-Verlag, 1994.

Index Terms:
Checkpointing, duplex systems, performance, reliability, roll-forward.
Citation:
Dhiraj K. Pradhan, Nitin H. Vaidya, "Roll-Forward and Rollback Recovery: Performance-Reliability Trade-Off," IEEE Transactions on Computers, vol. 46, no. 3, pp. 372-378, March 1997, doi:10.1109/12.580435
Usage of this product signifies your acceptance of the Terms of Use.