14th International Conference on Distributed Computing Systems (1994)
June 21, 1994 to June 24, 1994
A. Arora , Dept. of Comput. Sci., Ohio State Univ., Columbus, OH, USA
We present a method for the design of nonmasking fault-tolerant programs. In our method, a set of constraints is associated with each program. Each of these constraints is continually satisfied under the execution of program actions, as long as faults do not occur. Whenever some of the constraints are violated, due to certain faults, all constraints are eventually reestablished by subsequent execution of the program actions. To design programs thus, two types of program actions are distinguished: "closure" actions and "convergence" actions. Closure actions are the actions that perform the intended computation of the program when all of the constraints are satisfied. Convergence actions are the actions that reestablish the constraints when they have been violated. Sufficient conditions for the validation of closure and convergence actions are formalized in terms of a "constraint graph". These conditions are illustrated by designing nonmasking fault-tolerant programs for diffusing computations, atomic actions, and token rings.<
fault tolerant computing, software reliability, program testing, constraint handling
A. Arora, M. Gouda and G. Varghese, "Constraint satisfaction as a basis for designing nonmasking fault-tolerance," 14th International Conference on Distributed Computing Systems(ICDCS), Pozman, Poland, 1995, pp. 424-431.