This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Atomic Broadcast in Asynchronous Crash-Recovery Distributed Systems and Its Use in Quorum-Based Replication
September/October 2003 (vol. 15 no. 5)
pp. 1206-1217

AbstractAtomic Broadcast is a fundamental problem of distributed systems: It states that messages must be delivered in the same order to their destination processes. This paper describes a solution to this problem in asynchronous distributed systems in which processes can crash and recover. A Consensus-based solution to Atomic Broadcast problem has been designed by Chandra and Toueg for asynchronous distributed systems where crashed processes do not recover. We extend this approach: It transforms any Consensus protocol suited to the crash-recovery model into an Atomic Broadcast protocol suited to the same model. We show that Atomic Broadcast can be implemented requiring few additional log operations in excess of those required by the Consensus. The paper also discusses how additional log operations can improve the protocol in terms of faster recovery and better throughput. To illustrate the use of the protocol, the paper also describes a solution to the replica management problem in asynchronous distributed systems in which processes can crash and recover. The proposed technique makes a bridge between established results on Weighted Voting and recent results on the Consensus problem.

[1] M. Aguilera, W. Chen, and S. Toueg, Failure Detection and Consensus in the Crash-Recovery Model Distributed Computing, vol. 13, no. 2, pp. 99-125, 2000.
[2] Y. Amir, Replication Using Group Communication over a Partitioned Network PhD thesis, Hebrew Univ. of Jerusalem, 1995.
[3] H. Attiya and J. Welch, “Sequential Consistency versus Linearizability,” ACM Trans. Computer Systems, May 1994.
[4] Ö. ${\rm Babao\breve{g}lu}$ and S. Toueg, "Non-Blocking Atomic Commitment," Distributed Systems, S. Mullender, ed., ACM Press, New York, 1993, pp. 147-166.
[5] P. Bernstein, V. Hadzilacos, and N. Goodman, Concurrency Control and Recovery in Database Systems. Addison-Wesley, 1987.
[6] K.P. Birman and R. Van Renesse, Reliable Distributed Computing with the Isis Toolkit. IEEE CS Press, 1994.
[7] R. Boichat, P. Dutta, S. Frolund, and R. Guerraoui, Deconstructing Paxos technical report, Distributed Programming Laboratory, EPFL, ID:200106, Jan. 2001.
[8] T.D. Chandra and S. Toueg, “Unreliable Failure Detectors for Reliable Distributed Systems,” J. ACM, vol. 43, no. 2, pp. 225–267, 1996.
[9] D. Dolev, R. Friedman, I. Keidar, and D. Malkhi, Failure Detectors in Omission Failure Environments Proc. 16th Ann. ACM Symp. Principles of Distributed Computing, p. 286, 1997.
[10] M.J. Fischer, N.A. Lynch, and M.S. Paterson, “Impossibility of Distributed Consensus with One Faulty Process,” J. ACM, vol. 32, no. 2, pp. 374i–382, 1985.
[11] U. FritzkeJr., Ph. Ingels, A. Mostefaoui, and M. Raynal, Consensus-Based Fault-Tolerant Total Order Multicast IEEE Trans. Parallel and Distributed Systems, vol. 12, no. 2, 147-156, 2001.
[12] D. Barbara, H. Garcia-Molina, and A. Spauster, “Increasing Availability Under Mutual Exclusion Constraints with Dynamic Vote Reassignment,” ACM Trans. Computer Systems, vol. 7, no. 4, pp. 394-426, Nov. 1989.
[13] D.K. Gifford, “Weighted Voting for Replicated Data,” Proc. Seventh ACM SIGOPS Symp. Operating Systems Principles, pp. 150-159, Dec. 1979.
[14] J. Gray and A. Reuter, Transaction Processing: Concepts and Techniques, Morgan Kauffman, 1993.
[15] R. Guerraoui and A. Schiper, The Generic Consensus Service IEEE Trans. Software Eng., vol. 27, no. 1, 29-41, Jan. 2001.
[16] R. Guerraoui and A. Schiper, Software-Based Replication for Fault Tolerance Computer, pp. 68-74, Apr. 1997.
[17] R. Guerraoui, M. Hurfin, A. Mostefaoui, R. Oliveira, M. Raynal, and A. Schiper, Consensus in Asynchronous Distributed Systems: A Concise Guided Tour Chapter 1: Recent Advances in Large Scale Distributed Systems, Springer-Verlag, LNCS 1752, pp. 33-47, 1999
[18] V. Hadzilacos and S. Toueg, "Fault-Tolerant Broadcasts and Related Problems," in Distributed Systems, S. Mullender, ed., ACM Press, New York, 1993, pp. 97-138.
[19] M. Hurfin, A. Mostefaoui, and M. Raynal, “Consensus in Asynchronous Systems Where Processes can Crash and Recover,” Proc. 17th IEEE Symp. Reliable Distributed Systems, pp. 280-286, 1998.
[20] B. Kemme, A. Bartoli, and Ö. Babaoglu, Online Reconfiguration in Replicated Databases Based on Group Communication Proc. Int'l Conf. Dependable Systems and Networks, June 2001.
[21] L. Lamport, The Part-Time Parliament ACM Trans. Computer Systems, vol. 16, no. 2, pp. 133-169, 1998.
[22] R. Ladin, B. Liskov, L. Shrira, and S. Ghemawat, "Providing High Availability Using Lazy Replication," ACM Trans. Computer Systems, vol. 10, no. 4, pp. 360-391, Nov. 1992.
[23] N. Lynch, Data Link Protocols, chapter 16, Distributed Algorithms, Morgan-Kaufmann Publishers. pp. 691-732, 1996.
[24] M. Mizuno, M. Raynal, and J. Zhou, Sequential Consistency in Distributed Systems Proc. Int'l Workshop Theory and Practice in Distributed Systems, pp. 224-241, 1994.
[25] R. Oliveira, R. Guerraoui, and A. Schiper, Consensus in the Crash-Recovery Model Research report 97-239, EPFL, Lausanne, Switzerland 1997.
[26] F. Pedone, R. Guerraoui, and A. Schiper, Exploiting Atomic Broadcast in Replicated Databases Proc. Europar Conf. pp. 513-520, 1998.
[27] D. Powell, “Group Communication,” Comm. ACM, vol. 39, no. 4, Apr. 1996.
[28] L. Rodrigues, R. Guerraoui, and A. Schiper, Scalable Atomic Multicast Proc. Seventh Int'l Conf. Computer Communications and Networks (IC3N '98), pp. 840-847, Oct. 1998.
[29] L. Rodrigues and M. Raynal, Atomic Broadcast in Asynchronous Crash-Recovery Distributed Systems Proc. 20th IEEE Int'l Conf. Distributed Computing Systems, pp. 288-295, Apr. 2000.
[30] L. Rodrigues and M. Raynal, Quorum-Based Replication in Asynchronous Crash-Recovery Distributed Systems Proc. Sixth European Conf. Parallel Computing (Euro-Par 2000), pp. 605-609, Aug. 2000.

Index Terms:
Distributed fault-tolerance, asynchronous systems, atomic broadcast, consensus, crash/recovery, quorum, replica management, weighted voting.
Citation:
Lu? Rodrigues, Michel Raynal, "Atomic Broadcast in Asynchronous Crash-Recovery Distributed Systems and Its Use in Quorum-Based Replication," IEEE Transactions on Knowledge and Data Engineering, vol. 15, no. 5, pp. 1206-1217, Sept.-Oct. 2003, doi:10.1109/TKDE.2003.1232273
Usage of this product signifies your acceptance of the Terms of Use.