CSDL Home IEEE Transactions on Dependable and Secure Computing 2010 vol.7 Issue No.03 - July-September
Issue No.03 - July-September (2010 vol.7)
Tiejun Ma , Imperial College London, London
Jane Hillston , University of Edinburgh, Edinburgh
Stuart Anderson , University of Edinburgh, Edinburgh
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TDSC.2009.35
We model the probabilistic behavior of a system comprising a failure detector and a monitored crash-recovery target. We extend failure detectors to take account of failure recovery in the target system. This involves extending QoS measures to include the recovery detection speed and proportion of failures detected. We also extend estimating the parameters of the failure detector to achieve a required QoS to configuring the crash-recovery failure detector. We investigate the impact of the dependability of the monitored process on the QoS of our failure detector. Our analysis indicates that variation in the MTTF and MTTR of the monitored process can have a significant impact on the QoS of our failure detector. Our analysis is supported by simulations that validate our theoretical results.
Failure detectors, crash recovery, quality of service, availability, dependability, performance.
Tiejun Ma, Jane Hillston, Stuart Anderson, "On the Quality of Service of Crash-Recovery Failure Detectors", IEEE Transactions on Dependable and Secure Computing, vol.7, no. 3, pp. 271-283, July-September 2010, doi:10.1109/TDSC.2009.35