loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
22nd International Symposium on Reliable Distributed Systems (SRDS'03)
Group Communication Protocols under Errors
Florence, Italy
October 06-October 08
ISBN: 0-7695-1955-5
Claudio Basile, University of Illinois at Urbana-Champaign
Long Wang, University of Illinois at Urbana-Champaign
Zbigniew Kalbarczyk, University of Illinois at Urbana-Champaign
Ravi Iyer, University of Illinois at Urbana-Champaign

Group communication protocols constitute a basic building block for highly dependable distributed applications. Designing and correctly implementing a group communication system (GCS) is a difficult task. While many theoretical algorithms have been formalized and proved for correctness, only few research projects have experimentally assessed the dependability of GCS implementations under complex error scenarios.

This paper describes a thorough error-injection experimental campaign conducted on Ensemble, a popular GCS. By employing synthetic benchmark applications, we stress selected components of the GCS — the group membership service, the FIFO-ordered reliable multicast, and the sequencer-based, total-ordered reliable multicast — under various error models, including errors in the memory (text and heap segments) and in the network messages.

The data show that about 5-6% of the failures are due to an error escaping Ensemble?s error-containment mechanism and manifesting as a fail silence violation. This constitutes an impediment to achieving high dependability, the natural objective of GCSs. Our results are derived for a particular system (Ensemble), and more investigation involving other GCSs is required to generalize the conclusions. Nevertheless, through an accurate analysis of the failure causes and the error propagation patterns, this paper offers insights into the design and the implementation of robust GCSs.

Citation:
Claudio Basile, Long Wang, Zbigniew Kalbarczyk, Ravi Iyer, "Group Communication Protocols under Errors," srds, pp.35, 22nd International Symposium on Reliable Distributed Systems (SRDS'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.