This Article 
 Bibliographic References 
 Add to: 
Modeling Correlation in Software Recovery Blocks
November 1993 (vol. 19 no. 11)
pp. 1071-1086

The authors examine the problem of accurately modeling the software fault-tolerance technique based on recovery blocks. Analysis of some systems have investigated the correlation between software modules, which may be due to a portion of the functional specification that is common to all software modules, or to the inherent hardness of some problems. Three types of dependence which can be captured using measurements are considered. These are correlation between software modules for a single input, correlation between successive acceptance tests on correct module outputs and incorrect module outputs, and correlation between subsequent inputs. The authors' technique is quite general and can be applied to other types of correlation. In accounting for dependence, they use the intensity distribution introduced by D.E. Eckhardt and L.D. Lee (1985). A method of generating the intensity distribution that is based on the pairwise correlation between modules is discussed. This method is contrasted with the assumption of independent modules as well as the use of the beta-binomial density introduced by V.F. Nicola and A. Goyai (1990). The effects of dependencies were studied using a Stochastic Reward Network (SRN) that incorporates all of the above dependencies and a modeling tool called Stochastic Petri Net Package (SPNP).

[1] A. Avizienis, "The N-version approach to fault-tolerant software,"IEEE Trans. Software Eng., vol. SE-11, pp. 1491-1501, Dec. 1985.
[2] B. Randell, "System structure for software fault tolerance,"IEEE Trans. Software Eng., vol. SE-1, pp. 220-232, June 1975.
[3] R. K. Scott, J. W. Gault, and D. F. McAllister, "The consensus recovery block," inProc. Total System Reliability Symp., 1983, pp. 74-85.
[4] J. Knight and N. Leveson, "An experimental evaluation of the assumption of independence in multiversion programming,"IEEE Trans. Software Eng., vol. SE-12, no. 1, pp. 96-109, Jan. 1986.
[5] J. Knight and N. Leveson, "An empirical study of failure probabilities in multiversion software," inProc. 16th Int. Symp. on Fault-Tolerant Computing, July 1986, pp. 165-170.
[6] S. S. Brilliant, J. C. Knight, and N. G. Leveson, "Analysis of faults in an N-version software experiment,"IEEE Trans. Software Eng., vol. 16, pp. 238-247, Feb. 1990.
[7] R. K. Scott, J. W. Gault and D. F. McAllisier, "Fault tolerant software reliability modeling,"IEEE Trans. Software Eng., vol. SE-13, pp. 582-592, May 1987.
[8] G. Pucci, "On the modelling and testing of recovery block structures," inProc. 20th Int. Symp. on Fault Tolerant Computing(Newcastle upon Tyne, UK), 1990, pp. 356-363.
[9] G. Pucci, "A new approach to the modeling of recovery block structures,"IEEE Trans. Software Eng., vol. 18, pp. 356-363, Feb. 1992.
[10] D. E. Eckhardt and L. D. Lee, "A theoretical basis for the analysis of multiversion software subject to coincident errors,"IEEE Trans. Software Eng., vol. SE-11, pp. 1511-1517, Dec. 1985.
[11] V. F. Nicola and A. Goyal, "Modeling of correlated failures and community error recovery in multiversion software,"IEEE Trans. Software Eng., vol. 16, pp. 350-359, Mar. 1990.
[12] R. Geist, A. Offutt, and F. Harris, "Estimation and enhancement of real-time software reliability through mutation analysis,"IEEE Trans. Comput., (Special Issue on Fault-Tolerant Computing), vol. 41, pp. 550-558, May 1992.
[13] P. E. Ammann and J. C. Knight, "Data diversity: An approach to software fault tolerance," inProc. 17th Int. Symp. on Fault-Tolerant Computing(Los Alamitos, CA) (IEEE Computer Society Press, July 1987), pp. 122-126.
[14] A. Csenki, "Recovery block reliability analysis with failure clustering," inProc. IFIP Working Group 10.4 Int. Working Conf. on Dependable Computing for Critical Applications(University of California, Santa Barbara), Aug. 1989.
[15] J. C. Laprie, "Dependability evaluation of software systems,"IEEE Trans. Software Eng., vol. SE-10, pp. 701-714, Nov. 1984.
[16] M. C. Hsueh, R. K. Iyer, and K. S. Trivedi, "Performability modeling based on real data: A case study,"IEEE Trans. Comput., vol. 37, pp. 478-484, Apr. 1988.
[17] H. Choi, V. G. Kulkarni, and K. S. Trivedi, "Markov regenerative stochastic Petri nets," in16th IFIP W.G. 7.3 Int. Symp. on Computer Performance Modelling, Measurement and Evaluation (Performance'93), (Rome, Italy), Sept. 1993.
[18] G. Chiola, "A software package for the analysis of Generalized Stochastic Petri Net models," inProc. Int. Workshop Timed Petri Nets, July 1985.
[19] G. Ciardo, A. Blakemore, P. F. Chimento, J. K. Muppala, and K. S. Trivedi, "Automated generation and analysis of markov reward models using stochastic reward nets," inLinear Algebra, Markov Chains, and Queueing Models, IMA Volumes in Mathematics and Applications, C. Meyer and R. J. Plemmons, Eds., vol. 48, Heidelberg, Germany: Springer-Verlag, 1993.
[20] J. Couvillion, R. Freire, R. Johnson, W. O. II, A. Qureshi, M. Rai, W. Sanders, and J. Tvedt, "Performability modelling with ultrasan,"IEEE Software, pp. 69-80, Sept. 1991.
[21] W. J. Stewart, "Marca: Markov chain analyzer," inNumerical Solution of Markov Chains. New York: Dekker, 1991, pp. 37-61.
[22] G. Ciardo, J. K. Muppala, and K. S. Trivedi, "Analyzing concurrent and fault-tolerant software using stochastic reward nets,"J. Parallel Distributed Comput., vol. 15, no. 3, pp. 255-269, 1992.
[23] J. K. Muppala, S. P. Woolet, and K. S. Trivedi, "Real-time systems performance in the presence of failures,"IEEE Computer, vol. 24, pp. 37-47, May 1991.
[24] J. L. Peterson,Petri Net Theory and the Modeling of Systems. Englewood Cliffs, NJ: Prentice-Hall, 1981.
[25] G. Ciardo, J. Muppala, and K. Trivedi, "SPNP: Stochastic Petri Net Package,"Proc. Third Int'l Workshop Petri Nets and Performance Models, CS Press, Los Alamitos, Calif., Order No. 2001, 1989, pp. 142-151.
[26] M. K. Molloy, "Performance analysis using stochastic Petri nets,"IEEE Trans. Comput., vol. C-31, pp. 913-917, Sept. 1982.
[27] M. Ajmone Marsan, G. Balbo, and G. Conte, "A class of generalized stochastic Petri nets for the performance evaluation of multiprocessor systems,"ACM Trans. Comput. Syst., vol. 2, pp. 93-122, May 1984.

Index Terms:
software recovery blocks; software fault-tolerance technique; recovery blocks; functional specification; successive acceptance tests; correct module outputs; pairwise correlation; beta-binomial density; Stochastic Reward Network; Stochastic Petri Net Package; SPNP; Markov models; software reliability; stochastic modeling; stochastic Petri nets; correlation; fault tolerant computing; Petri nets; software reliability; statistical analysis; system recovery
L.A. Tomek, J.K. Muppala, K.S. Trivedi, "Modeling Correlation in Software Recovery Blocks," IEEE Transactions on Software Engineering, vol. 19, no. 11, pp. 1071-1086, Nov. 1993, doi:10.1109/32.256854
Usage of this product signifies your acceptance of the Terms of Use.