The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.10 - October (1997 vol.46)
pp: 1137-1141
ABSTRACT
<p><b>Abstract</b>—A method of execution retry for bypassing software faults in message-passing applications is described in this paper. Based on the techniques of checkpointing and message logging, we demonstrate the use of message replaying and message reordering as two mechanisms for achieving localized and fast recovery. The approach gradually increases the rollback distance and the number of affected processes when a previous retry fails, and is therefore named <it>progressive retry</it>. Examples from telecommunications software systems and performance measurements from an application-level implementation are described to illustrate the benefits of the scheme.</p>
INDEX TERMS
Fault tolerance, distributed systems, protocols, checkpointing, logging, rollback recovery, message reordering, recovery escalation, telecommunication systems.
CITATION
Yi-Min Wang, Yennun Huang, W. Kent Fuchs, Chandra Kintala, Gaurav Suri, "Progressive Retry for Software Failure Recovery in Message-Passing Applications", IEEE Transactions on Computers, vol.46, no. 10, pp. 1137-1141, October 1997, doi:10.1109/12.628398
16 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool