The Community for Technology Leaders
2013 IEEE 33rd International Conference on Distributed Computing Systems (2001)
Mesa, AZ
Apr. 16, 2001 to Apr. 19, 2001
ISBN: 0-7695-1077-9
pp: 0350
Christof Fetzer , AT&T Labs Research
ABSTRACT
Abstract: Perfect failure detectors can correctly decide whether a computer is crashed. However, it is impossible to implement a perfect failure detector in purely asynchronous systems. We show how to enforce perfect failure detection in timed distributed systems with hardware watchdogs. The two main system model assumptions are (1) each computer can measure time intervals with a known maximum error, and (2) each computer has a watchdog that crashes the computer unless the watchdog is periodically updated. We have implemented a system that satisfies both assumptions using a combination of off-the-shelf software and hardware.
INDEX TERMS
CITATION
Christof Fetzer, "Enforcing Perfect Failure Detection", 2013 IEEE 33rd International Conference on Distributed Computing Systems, vol. 00, no. , pp. 0350, 2001, doi:10.1109/ICDSC.2001.918965
97 ms
(Ver )