This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Communication Adaptive Self-Stabilizing Group Membership Service
July 2003 (vol. 14 no. 7)
pp. 709-720

Abstract—This paper presents the first (randomized) algorithm for implementing self-stabilizing group communication services in an asynchronous system. Our algorithm converges rapidly to legal behavior and is communication adaptive, namely, the communication volume is high when the system recovers from the occurrence of faults and is low once a legal state is reached. Communication adaptability is achieved by a new technique that combines transient fault detectors.

[1] Y. Afek and G.M. Brown, Self-Stabilization over Unreliable Communication Media Distributed Computing, no. 7, pp. 27-34, 1993.
[2] T. Anker, D.Breitgand, D. Dolev, and Z. Levy, Congress: CONnection-Oriented Group-Address RES-lution Service Technical Report CS96-23, Inst. Computer Science, The Hebrew Univ. of Jerusalem, Jerusalem, Israel, Dec. 1996.
[3] G. Alari and A. Ciuffoletti, Group Membership in a Synchronous Distributed System Proc. Fifth IEEE Symp. Parallel and Distributed Processing, pp. 490-493, 1993.
[4] Y. Afek and S. Dolev, “Local Stabilizer,” Proc. Fifth Israeli Symp. Theory of Computing and Systems, pp. 74–84, 1997.
[5] A. Arora and S. Kulkarni, Detectors and Correctors: A Theory of Fault-Tolerance Components Proc. Int'l Conf. Distributed Computing Systems, pp. 436-443, 1998.
[6] J. Beauquier, S. Delaet, S. Dolev, and S. Tixeuil, Transient Fault Detectors Proc. 12th Int'l Symp. Distributed Computing, pp. 62-74, 1998.
[7] Ö. Babaoglu, R. Davoli, L.A. Giachini, and M.G. Baker, "Relacs: A Communications Infrastructure for Constructing Reliable Applications in Large-Scale Distributed Systems," Proc. 28th Hawaii Int'l Conf. System Sciences, pp. 612-621, Jan. 1995.
[8] K.P. Birman and R. Van Renesse, Reliable Distributed Computing with the Isis Toolkit. IEEE CS Press, 1994.
[9] F. Cristian, Reaching Agreement on Processor Group Membership in Synchronous Distributed Systems Distributed Computing, vol. 4, no. 4, pp. 175-187, Apr. 1991.
[10] F. Cristian and F. Schmuck, Agreeing on Processor Group Membership in Asynchronous Distributed Systems Technical Report CSE95-428, Dept. of Computer Science, Univ. of California at San Diego, 1995.
[11] T.D. Chandra, V. Hadzillacos, S. Toueg, and B. Charron-Bost, “On the Impossibility of Group Membership,” Proc. 15th ACM Symp. Principles of Distributed Computing, pp. 322–330, 1996.
[12] E.W. Dijkstra,“Self-stabilizing systems in spite of distributed control,” Comm. ACM, vol. 17, no. 11 pp. 643-644, 1974,.
[13] S. Dolev, Self-Stabilizing Routing and Related Protocols J. Parallel and Distributed Computing, vol. 42, pp. 122-127, May 1997.
[14] S. Dolev, Optimal Time Self-Stabilization in Uniform Dynamic Systems Parallel Processing Letters, vol. 8, no. 1, pp. 7-18, 1998.
[15] S. Dolev, Self-Stabilization. Cambridge, Mass.: The MIT Press, 2000.
[16] S. Dolev and T. Herman, Superstabilizing Protocols for Dynamic Distributed Systems Chicago J. Theoretical Computer Science, 1997.
[17] S. Dolev, A. Israeli, and S. Moran, "Analyzing Expected Time by Scheduler-Luck Games," IEEE Trans. Software Eng., vol. 21, no. 5, pp. 429-439, May 1995.
[18] S. Dolev, A. Israeli, and S. Moran, Uniform Dynamic Self-Stabilizing Leader Election IEEE Trans. Parallel and Distributed Systems, vol. 8, no. 4, pp. 424-440, Apr. 1997.
[19] R. De Prisco, A. Fekete, N. Lynch, and A.A. Shvartsman, A Dynamic Primary Configuration Group Communication Service Proc. 13th Int'l Conf. Distributed Computing, pp. 64-78, Sept. 1999.
[20] D. Dolev and D. Malki, “The Transis Approach to High Availability Cluster Communication,” Comm. ACM, vol. 39, no. 4, pp. 64–70, 1996.
[21] S. Dolev and E. Schiller, Communication Adaptive Self-Stabilizing Group Communication Technical Report TR2002-02, Dept. of Computer Science, Ben-Gurion Univ. of the Negev, July 2000.
[22] P. Ezhilchelvan, R. Macedo, and S. Shrivastava, "Newtop: A Fault-Tolerant Group Communication Protocol," Proc. 15th Int'l Conf. Distributed Computing Systems, IEEE CS Press, Vancouver, BC, Canada, June 1995.
[23] A. Fekete, N. Lynch, and A. Shvartsman, Specifying and Using a Partitionable Group Communication Service Proc. ACM Symp. Principles of Distributed Computing, pp. 53-62, 1997.
[24] I. Keidar, J. Sussman, K. Marzullo, and D. Dolev, Moshe: A Group Membership Service for WANs MIT Technical Memorandum MIT-LCS-TM-593a, Sept. 2000.
[25] E. Kushilevitz and N. Nisan, Comm. Complexity. Cambridge Univ. Press, 1998.
[26] L.E. Moser, P.M. Melliar-Smith, D.A. Agarwal, R.K. Budhia, and C.A. Lingley-Papadopoulos, “Totem: A Fault-Tolerant Multicast Group Communication System,” Comm. ACM, vol. 39, no. 4, pp. 54–63, 1996.
[27] G. Neiger, “A New Look at Membership Services,” Proc. 15th ACM Symp. Principles of Distributed Computing, pp. 331–340, 1996.
[28] R. van Renesse, K.P. Birman, and S. Maffeis, “Horus: A Flexible Group Communication System,” Comm. ACM, vol. 39, no. 4, pp. 76–83, 1996.
[29] R. v Renesse, K. Birman, M. Hayden, A. Vaysburd, and D. Karr, “Building Adaptive Systems Using Ensemble,” Software Practice and Experience, vol. 28, no. 9, pp. 963-979, July 1998.
[30] R. van Renesse, Y. Minsky, and M. Hayden, A Gossip-Style Failure Detection Service Proc. IFIP Int'l Conf. Distributed Systems Platforms and Open Distributed Processing (Middleware '98), pp. 55-70, Sept. 1998.
[31] E. Schiller, Communication Adaptive Self-Stabilizing Group Membership Service PhD Proposal, Dept. of Computer Science, Ben-Gurion Univ., 2001.
[32] A. Segall, "Distributed Networks Protocols," IEEE Trans. Comm., vol. 29, no. 1, pp. 23-35, Jan. 1983.

Index Terms:
Self-stabilization, fault tolerance, group communication, middleware.
Citation:
Shlomi Dolev, Elad Schiller, "Communication Adaptive Self-Stabilizing Group Membership Service," IEEE Transactions on Parallel and Distributed Systems, vol. 14, no. 7, pp. 709-720, July 2003, doi:10.1109/TPDS.2003.1214322
Usage of this product signifies your acceptance of the Terms of Use.