Issue No.03 - March (1990 vol.16)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/32.48942
<p>Three aspects of the modeling of multiversion software are considered. First, the beta-binomial distribution is proposed for modeling correlated failures in multiversion software. Second, a combinatorial model for predicting the reliability of a multiversion software configuration is presented. This model can take as inputs failure distributions either from measurements or from a selected distribution (e.g. beta-binomial). Various recovery methods can be incorporated in this model. Third, the effectiveness of the community error recovery method based on checkpointing is investigated. This method appears to be effective only when the failure behaviors of program versions are lightly correlated. Two different types of checkpoint failure are also considered: an omission failure where the correct output is recognized at a checkpoint but the checkpoint fails to correct the wrong outputs and a destructive failure where the good versions get corrupted at a checkpoint.</p>
correlated failures; community error recovery; multiversion software; beta-binomial distribution; combinatorial model; software configuration; failure distributions; selected distribution; recovery methods; checkpointing; failure behaviors; lightly correlated; checkpoint failure; omission failure; destructive failure; fault tolerant computing; software reliability; system recovery.
V.F. Nicola, A. Goyal, "Modeling of Correlated Failures and Community Error Recovery in Multiversion Software", IEEE Transactions on Software Engineering, vol.16, no. 3, pp. 350-359, March 1990, doi:10.1109/32.48942