loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Eighth IEEE International Workshop on Object-Oriented Real-Time Dependable Systems (WORDS'03)
An Optimal Atomic Broadcast Protocol and an Implementation Framework
Guadalajara, Mexico
January 15-January 17
ISBN: 0-7695-1929-6
Paul EZHILCHELVAN, Dept of Computing Science, University of Newcastle, NE1 7RU, UK
Doug PALMER, Dept of Computing Science, University of Newcastle, NE1 7RU, UK
Michel RAYNAL, IRISA, Campus de Beaulieu, 35042 Rennes Cedex, France
Atomic Broadcast (where all processes deliver broadcast messages in the same order) is a very useful group communication primitive for building fault-tolerant distributed systems. This paper presents an atomic broadcast protocol that can be claimed to be optimal in terms of failure detection, resilience, and latency. The protocol requires only the weakest of the useful failure detectors for liveness, and permits up to (n-1)/2 processes to crash in a system of n processes; at most two communication steps and n broadcasts are needed in a run during which process crashes and failure-suspicions do not occur. We also introduce the notion of Notifying Broadcast which can reduce the message overhead further in 'nice' runs in which all processes are operational and communication delays do not exceed the bound assumed. If nice runs persist, the average message overhead is just one broadcast. That is, the protocol extracts no message overhead for providing crash-tolerance if process failures and unanticipated fluctuations in communication delays do not occur. We are currently implementing our protocol as a CORBA component. All known ORBs use IIOP as the standard protocol for inter-process communication, which in turn uses TCP/IP as the common transport protocol. It turns out that the Notifying Broadcast is straightforward to implement on top of TCP transport layer.
Index Terms:
Asynchronous Distributed System, Atomic Broadcast, Communication Step, Consensus, Crash Failure, Notifying Broadcast, Reliable Broadcast.
Citation:
Paul EZHILCHELVAN, Doug PALMER, Michel RAYNAL, "An Optimal Atomic Broadcast Protocol and an Implementation Framework," words, pp.32, Eighth IEEE International Workshop on Object-Oriented Real-Time Dependable Systems (WORDS'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.