Issue No.07 - July (2006 vol.17)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2006.95
<p><b>Abstract</b>—Leader-based protocols rest on a primitive able to provide the processes with the same unique leader. Such protocols are very common in distributed computing to solve synchronization or coordination problems. Unfortunately, providing such a primitive is far from being trivial in asynchronous distributed systems prone to process crashes. (It is even impossible in fault-prone purely asynchronous systems.) To circumvent this difficulty, several protocols have been proposed that build a leader facility on top of an asynchronous distributed system enriched with additional assumptions. The protocols proposed so far consider either additional assumptions based on synchrony or additional assumptions on the pattern of the messages that are exchanged. Considering systems with <tmath>n</tmath> processes and up to <tmath>f</tmath> process crashes, <tmath>1\leq f <n</tmath>, this paper investigates the combination of a time-free assumption on the message pattern with a synchrony assumption on process speed and message delay. It shows that both types of assumptions can be combined to obtain a hybrid eventual leader protocol benefiting from the best of both worlds. This combined assumption considers a star communication structure involving <tmath>f+1</tmath> processes. Its noteworthy feature lies in the level of combination of both types of assumption that is "as fine as possible” in the sense that each of the <tmath>f</tmath> channels of the star has to satisfy a property independently of the property satisfied by each of the <tmath>f-1</tmath> other channels (the <tmath>f</tmath> channels do not have to satisfy the same assumption). More precisely, this combined assumption is the following: There is a correct process <tmath>p</tmath> (center of the star) and a set <tmath>Q</tmath> of <tmath>f</tmath> processes <tmath>q</tmath> (<tmath>p \notin Q</tmath>) such that, eventually, either 1) each time it broadcasts a query, <tmath>q</tmath> receives a response from <tmath>p</tmath> among the <tmath>(n-f)</tmath> first responses to that query, or 2) the channel from <tmath>p</tmath> to <tmath>q</tmath> is timely. (The processes in the set <tmath>Q</tmath> can crash.) A surprisingly simple eventual leader protocol based on this fine grain hybrid assumption is proposed and proved correct. An improvement is also presented.</p>
Asynchronous system, distributed algorithm, fault tolerance, hybrid protocol, leader election, process crash, time-free assumption, timer-based assumption.
Michel Raynal, Achour Mostefaoui, "Time-Free and Timer-Based Assumptions Can Be Combined to Obtain Eventual Leadership", IEEE Transactions on Parallel & Distributed Systems, vol.17, no. 7, pp. 656-666, July 2006, doi:10.1109/TPDS.2006.95