Issue No. 01 - Jan. (1986 vol. 12)
Roy H. Campbell , Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana. IL 61801
Two complementary techniques have evolved for providing fault-tolerance in software: forward error recovery and backward error recovery. Few implementations permit both approaches to be combined within a particular application. Fewer techniques are available for the construction of fault-tolerant software for systems involving concurrent processes and multiple processors. Many schemes for supporting forward or backward recovery are based on some concept of an atomic action. In this paper, we propose a mechanism for supporting an atomic action in a system of Communicating Sequential Processes (CSP). The atomic action is used as the basic unit for providing fault-tolerance. The atomic action is called an FT-Action, and both forward and backward error recovery are performed in the context of an FT-Action. An implementation for the FT-Action is proposed, which employs a distributed control, uses CSP primitives, and supports local compile and runtime checking of the forward and backward error recovery schemes.
Fault tolerance, Fault tolerant systems, Software, Computer languages, Process control, Synchronization, software fault-tolerance, Atomic actions, backward recovery, communicating sequential processes, forward recovery
R. H. Campbell, "Atomic actions for fault-tolerance using CSP," in IEEE Transactions on Software Engineering, vol. 12, no. , pp. 59-68, 1986.