This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
27th International Symposium on Fault-Tolerant Computing (FTCS '97)
Experimental Evaluation of Failure-Detection Schemes in Real-time Communication Networks
Seattle, WA
June 25-June 27
ISBN: 0-8186-7831-3
Seungjae Han, Real-Time Computing Laboratory The University of Michigan
Kang G. Shin, Real-Time Computing Laboratory The University of Michigan
An effective failure-detection scheme is essential for reliable communication services. Most computer networks rely on behavior-based detection schemes: each node uses heartbeats to detect the failure of its neighbor nodes, and the transport protocol (like TCP) achieves reliable communication by acknowledgment/retransmission. In this paper, we experimentally evaluate the effectiveness of such behavior-based detection schemes in real-time communication. Specifically, we measure and analyze the coverage and latency of two failure-detection schemes --- neighbor detection and end-to-end detection --- through fault-injection experiments. The experimental results have shown that a significant portion of failures can be detected very quickly by the neighbor detection scheme, while the end-to-end detection scheme uncovers the remaining failures with larger detection latencies.
Index Terms:
Real-time communication, network failures, failure detection, fault-injection experiments.
Citation:
Seungjae Han, Kang G. Shin, "Experimental Evaluation of Failure-Detection Schemes in Real-time Communication Networks," ftcs, pp.122, 27th International Symposium on Fault-Tolerant Computing (FTCS '97), 1997
Usage of this product signifies your acceptance of the Terms of Use.