Issue No. 05 - May (1992 vol. 41)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/12.142683
<p>Based on the measurements from two DEC VAX-cluster multicomputer systems, the issue of correlated failures is addressed. In particular, the characteristics of correlated failures, their impact and their modelling on dependability, are discussed. It is found from the data that most correlated failures are related to errors in shared resources and propagate from one machine to another. Comparisons between measurement-based models and analytical models that assume failure independence show that the impact of correlated failures on dependability is significant. Two validated models. the c-dependent model and the p-dependent model, are developed to evaluate the dependability of systems with correlated failures.</p>
correlated failures; multicomputer systems; DEC VAX-cluster; dependability; shared resources; c-dependent model; p-dependent model; computation theory; fault tolerant computing; multiprocessing systems.
R.K. Iyer, D. Tang, "Analysis and Modeling of Correlated Failures in Multicomputer Systems", IEEE Transactions on Computers, vol. 41, no. , pp. 567-577, May 1992, doi:10.1109/12.142683