Issue No.03 - May (1983 vol.9)
D. Skeen , Department of Computer Science, Cornell University
A formal model for atomic commit protocols for a distributed database system is introduced. The model is used to prove existence results about resilient protocols for site failures that do not partition the network and then for partitioned networks. For site failures, a pessimistic recovery technique, called independent recovery, is introduced and the class of failures for which resilient protocols exist is identified. For partitioned networks, two cases are studied: the pessimistic case in which messages are lost, and the optimistic case in which no messages are lost. In all cases, fundamental limitations on the resiliency of protocols are derived.
transaction management, Commit protocols, crash recovery, distributed database systems, distributed systems, fault tolerance
D. Skeen, M. Stonebraker, "A Formal Model of Crash Recovery in a Distributed System", IEEE Transactions on Software Engineering, vol.9, no. 3, pp. 219-228, May 1983, doi:10.1109/TSE.1983.236608