Issue No.01 - January (2004 vol.15)
Francisco V. Brasileiro , IEEE Computer Society
<p><b>Abstract</b>—Replicated processing with majority voting is a well-known method for achieving reliability and availability. Triple Modular Redundant (TMR) processing is the most commonly used version of that method. Replicated processing requires that the replicas reach agreement on the order in which input requests are to be processed. Almost all synchronous and deterministic ordering protocols published in the literature are time-based in the sense that they require replicas' clocks to be kept synchronized within some known bound. We present a protocol for TMR systems that is based on timeouts and does not require clocks to be kept in bounded synchronism. Our design efforts focus on keeping the ordering delays small, without an unnecessary increase in message overhead. Consequently, we are able to show that no symmetric protocol that works only with unsynchronized clocks can provide a smaller worst-case delay. We also demonstrate through analysis and experiments that our protocol is faster than a time-based one of identical message complexity in certain situations which can prevail in many application settings.</p>
Byzantine failures, fault tolerance, Triple Modular Redundancy (TMR), process replication, agreement, message ordering, physical and logical clocks.
Paul D. Ezhilchelvan, Francisco V. Brasileiro, Neil A. Speirs, "A Timeout-Based Message Ordering Protocol for a Lightweight Software Implementation of TMR Systems", IEEE Transactions on Parallel & Distributed Systems, vol.15, no. 1, pp. 53-65, January 2004, doi:10.1109/TPDS.2004.1264786