Structuring Distributed Systems for Recoverability and Crash Resistance
July 1981 (vol. 7 no. 4)
pp. 436-447
S.K. Shrivastava, Computing Laboratory, University of Newcastle-upon-Tyne
An object-oriented multilevel model of computation is used to discuss recoverability and crash resistance issues in distributed systems. Of particular importance are the issues that are raised when recoverability and crash resistance properties are desired from objects whose concrete representations are distributed over several nodes. The execution of a program at a node of the system can give rise to a hierarchy of processes executing various parts of the program at different nodes. Recoverability and crash resistance properties are needed to ensure that such a group of processes leave the system state consistent despite faults in the system.
secure storage, Atomic actions, backward error recovery, commitment, concurrency, consistency, crash resistance, distributed systems, exception handling, message passing, recoverability
S.K. Shrivastava, "Structuring Distributed Systems for Recoverability and Crash Resistance," IEEE Transactions on Software Engineering, vol. 7, no. 4, pp. 436-447, July 1981, doi:10.1109/TSE.1981.230846
