• Publication
  • 1991
  • Issue No. 4 - April
  • Abstract - The KAT (Knowledge-Action-Transformation) Approach to the Modeling and Evaluation of Reliability and Availability Growth
 This Article 
 Bibliographic References 
 Add to: 
The KAT (Knowledge-Action-Transformation) Approach to the Modeling and Evaluation of Reliability and Availability Growth
April 1991 (vol. 17 no. 4)
pp. 370-382

An approach for the modeling and evaluation of reliability and availability of systems using the knowledge of the reliability growth of their components is presented. Detailed models of reliability and availability for single-component systems are derived under much weaker assumption than usually considered. These models, termed knowledge models, enable phenomena to be precisely characterized, and a number of properties to be deduced. Since the knowledge models are too complex to be applied in real life for performing predictions, simplified models for practical purposes (action models) are discussed. The hyperexponential model is presented and applied to field data of software and hardware failures. This model is shown to be comparable to other models as far as reliability of single-component systems is concerned: in addition, it enables estimating and predicting the reliability of multicomponent systems, as well as their availability. The transformation approach enables classical Markov models to be transformed into other Markov models which account for reliability growth. The application of the transformation to multicomponent systems is described.

[1] N. Adams, "Optimizing preventive service of software products,IBM J. Res. Develop., vol. 28, no. 1, pp. 2-14, Jan. 1984.
[2] A. A. Abdel-Ghaly, P. Y. Chan, and B. Littlewood, "Evaluation of competing software reliability predictions,"IEEE Trans. Software Eng., vol. SE-12, no. 9, Sept. 1986.
[3] V. Amoia, G. De Micheli, and M. Santomauro, "Computer-oriented formulation of transition-rate matrices via Kronecker algebra,"IEEE Trans. Rel., vol. R-30, no. 2, June 1981.
[4] J. Arlat, K. Kanoun, and J. C. Laprie, "Dependability evaluation of software fault-tolerance," inProc. 18th IEEE Int. Symp. Fault Tolerant Computing (FTCS-18), Tokyo, Japan, June 1988, pp. 142-147.
[5] R. L. Aveyard and F. T. Man, "A study on the reliability of the circuit maintenance system 1-B,"Bell Syst. Tech. J., vol. 59, pp. 1317-1332, Oct. 1980.
[6] R. E. Barlow and F. Proschan,Statistical Theory of Reliability and Life Testing. New York: Holt, 1975.
[7] A. Birolini, "Some applications of regenerative stochastic processes to reliability theory, Part one: Tutorial introduction,"IEEE Trans. Rel., vol. R-23, no. 3, pp. 186-194, Aug. 1974.
[8] H. A. Bauer, L. M. Croxall, and E. A. Davis, "The 5ESS switching system: System test, first-office application, and early field experience, "AT&T Tech. J., vol. 64, no. 6, pp. 1503-1522, 1985.
[9] C. Beounes and J. C. Laprie, "Dependability evaluation of complex computer systems: Stochastic Petri net modeling," inProc. 15th IEEE Int. Symp. Fault-Tolerant Computing, Ann Arbor, MI, June 1985, pp. 364-369.
[10] B. Beyaert, G. Florin, P. Lonc, and S. Natkin, "Evaluation of computer systems dependability using stochastic Petri nets," inProc. 11th IEEE Int. Symp. Fault-Tolerant Computing, Portland, ME, June 1981, pp. 79-81.
[11] A. Costes, C. Landrault, and J. C. Laprie, "Reliability and availability models for maintained systems featuring hardware failures and design faults,"IEEE Trans. Comput., vol. C-27, pp. 548-560, June 1978.
[12] A. Costes, J. E. Doucet, C. Landrault, and J. C. Laprie, "SURF: A program for dependability evaluation of complex fault-tolerant computing systems," inProc. 11th IEEE Int. Symp. Fault-Tolerant Computing, Portland, ME, June 1981, pp. 72-78.
[13] D. R. Cox,Renewal Theory. London: Methuen, 1962.
[14] D. R. Cox and H. D. Miller,The Theory of Stochastic Processes. London: Methuen, 1968.
[15] P. A. Currit, M. Dyer, and H. D. Mills, "Certifying the reliability of software,"IEEE Trans. Software Eng., vol. SE-12, no. 1, pp. 3-11, Jan. 1986.
[16] J. T. Duane, "Learning curve approach to reliability monitoring,"IEEE Trans. Aerospace, vol. 2, pp. 563-566, 1964.
[17] J. M. Finkelstein, "Starting and limiting values for reliability growth,"IEEE Trans. Rel., vol. R-28, no. 2, pp. 111-113, June 1979.
[18] B. V. Gnedenko, Y. K. Belyayev, and A. D. Solovyev,Mathematical Methods of Reliability Theory. New York: Academic, 1969.
[19] A. L. Goel and K. Okumoto, "Time dependent error detection rate model for software and other performance measures,"IEEE Trans. Rel., vol. R-28, pp. 206-211, Aug. 1979.
[20] A. Goyal, W. C. Carter, E. de Souza e Silva, and S. S. Lavenberg," "The system availability estimator, "inProc. 16th IEEE Int. Symp. Fault-Tolerant Computing, Vienna, Austria, June 1986, pp. 84-89.
[21] D. Gross and D. R. Miller, "The randomization technique as a modeling tool and solution procedure for transient Markov processes",Oper. Res., vol. 32, no. 2, pp. 343-361, 1984.
[22] H. Hecht, "Fault-tolerant software,"IEEE Trans. Rel., vol. R-28, pp. 227-232, Aug. 1979.
[23] Z. Jelinski and B. P. Moranda, "Software reliability research," inStatistical Methods for the Evaluation of Computer System Performance. New York: Academic, 1972, pp. 465-484.
[24] M. Kaaniche, K. Kanoun, and S. Metge, "Role of the hyperexponential model in the software validation process of a telecommunication equipment," inProc. 7th Int. Conf. Reliability and Maintainability, Brest, France, June 1990, pp. 332-339 (in French).
[25] K. Kanoun and J. C. Laprie, "Modeling software reliability and availability from development validation up to operation," LAAS Res. Rep. 85.042, Aug. 1985.
[26] K. Kanoun and T. Sabourin, "Software dependability of a telephone switching system," inProc. 17th IEEE Int. Symp. Fault Tolerant Computing (FTCS-17), Pittsburgh, PA, June 1987, pp. 236-241.
[27] K. Kanoun, J. C. Laprie, and T. Sabourin, "A method for software reliability growth analysis and assessment," inProc. 1st Int. Workshop Software Engineering and Its Applications, Toulouse, France, Dec. 1988, pp. 859-878.
[28] K. Kanoun, "Software dependability growth Characterization, modeling, evaluation," Docteurès-Sciences thesis, Toulouse Polytechnic National Institute, LAAS Rep. 89.320, Sept. 1989 (in French).
[29] K. Kanoun, M. Bastos Martini, and J. Moreira De Souza, "A method for software reliability analysis and prediction application to the TROPICO-R switching system," this issue, pp. 334-344.
[30] P. A. Keiller, B. Littlewood, D. R. Miller, and A. Sofer, "Comparison of software reliability predictions," inProc. 13th IEEE Int. Symp. Fault-Tolerant Computing, Milano, Italy, June 1983, pp. 128-134.
[31] J. G. Kemeny and J. L. Snell,Finite Markov Chains. Princeton, NJ: Van Nostrand, 1959.
[32] J. C. Laprie, "Dependability modeling and evaluation of hardware-and-software systems," inProc. 2nd GI/NTG/GMR Conf. Fault Tolerant Computing, Bonn, Germany, Sept. 1984, pp. 202-215.
[33] J. C. Laprie, "Dependability evaluation of software systems in operation,"IEEE Trans. Software Eng., vol. SE-10, no. 6, pp. 701-714, Nov. 1984.
[34] J. C. Laprie, "Towards an X-ware reliability theory,"Technique et Science Informatiques, vol. 7, no. 3, pp. 315-330, 1988 (in French); Available in English as LAAS Rep. 86.376, Dec. 1986.
[35] J. C. Laprie, "Hardware-and-software dependability evaluation," inProc. IFIP 11th World Computer Congress, San Francisco, CA, Aug. 1989, pp. 109-114.
[36] J. C. Laprie, C. Beounes, M. Kaaniche, and K. Kanoun, "The transformation approach to modeling and evaluation of the reliability and availability growth of systems," inProc. 20th IEEE Int. Symp. Fault Tolerant Computing (FTCS-20), Newcastle, England, June 1990, pp. 364-371.
[37] Y. Levendel, "Defects and reliability analysis of large software systems: Field experience," inProc. 19th IEEE Int. Symp. Fault Tolerant Computing (FTCS-19), Chicago, IL, June 1989, pp. 238-244.
[38] P. A. Lewis, "A branching Poisson process model for the analysis of computer failure patterns,"J. Roy. Statist. Soc. B, vol. 26, no. 3, pp. 398-456, 1964.
[39] B. Littlewood and J. L. Verral, "A Bayesian Reliability Growth Model for Computer Software,"J. Roy. Statist. Soc. C (Appl. Statist.), vol. 22, pp. 332-336, 1973.
[40] B. Littlewood, "Software reliability model for modular program structure,"IEEE Trans. Rel., vol. R-28, no. 3, pp. 241-246, Aug. 1979.
[41] B. Littlewood, "Theories of software reliability: How good are they and how can they be improved?"IEEE Trans. Software Eng., vol. SE-6, no. 5, pp. 489-500, Sept. 1980.
[42] B. Littlewood, "Stochastic reliability growth: A model for fault-removal in computer programs and hardware designs,"IEEE Trans. Rel., vol. R-30, no. 4, pp. 313-320, Oct. 1981.
[43] M. Ajmone Marsan, G. Balbo, and G. Conte, "A class of generalized stochastic Petri nets for the performance evaluation of multiprocessor systems,"ACM Trans. Comput. Syst., vol. 2, pp. 93-122, May 1984.
[44] S. Metge, "Analysis and evaluation of the reliability of two ESS software systems," Eng. thesis, CNAM, Rep. LAAS 90.112 May 1990 (in French).
[45] D. R. Miller, "Exponential order statistic models of software reliability growth,"IEEE Trans. Software Eng., vol. SE-12, no. 1, pp. 12-24, Jan. 1986.
[46] M. Molloy, "Performance analysis using stochastic Petri nets,"IEEE Trans. Comput., vol 39, no. 9, pp. 913-917, Sept. 1982.
[47] J. D. Musa, "Software reliability data," Data and Analysis Centre for Software, Rome Air Development Center (RADC), Rome, NY, Tech. Rep., 1979.
[48] J. D. Musa and K. Okumoto, "A logarithmic Poisson execution time model for software reliability measurement," inProc. Compsac' 84, Chicago, IL, 1984, pp. 230-238.
[49] P. M. Nagel and J. A. Skrivan, "Software reliability: repetitive run experimentation and modeling," NASA Rep. CR-165 836, Feb. 1982.
[50] M. Ohba, "Software reliability analysis models,"IBM J. Res. Develop., vol. 21, no. 4, pp. 428-443, July 1984.
[51] P. I. Pignal, "An analysis of hardware and software availability exemplified on the IBM 3725 communication controller,"IBM J. Res. Develop., vol. 32, no. 2, pp. 268-278, Mar. 1988.
[52] A. Reibman, R. Smith, and K. Trivedi, "Markov and Markov reward model transient analysis: An overview of numerical approaches,"European J. Oper. Res., vol. 40, pp. 257-267, 1989.
[53] W. B. Rohn and T. F. Arnold, "Design for low expected downtime control systems," inProc. 4th Int. Conf. Computer Communications, Philadelphia, PA, June 1972, pp. 16-25.
[54] G. E. Stark, "Dependability evaluation of integrated hardware/ software systems,"IEEE Trans. Rel., vol. R-36, no. 4, pp. 440-444, Oct. 1987.
[55] W. J. Stewart and A. Goyal, "Matrix methods in large dependability models," IBM Res. Rep. RC-11485, Yorktown Heights, NY, Nov. 1985.
[56] Y. Tohma, K. Tokunaga, S. Nagase, and Y. Murata, "Structural approach to the estimation of the number of residual faults based on the hyper-geometric distribution,"IEEE Trans. Software Eng., vol. 15, no. 3, pp. 345-355, Mar. 1989.
[57] K. S. Trivedi, "Reliability evaluation for fault-tolerant systems," inMathematical Computer Performance and Reliability, G. Iazeolla, P. J. Courtois, and A. Hordijk, Eds. Amsterdam, The Netherlands: North-Holland, 1984, pp. 403-414.
[58] J. J. Wallace and W. W. Barnes, "Designing for ultrahigh availability: The Unix RTR operating system,"Computer, pp. 31-39, Aug. 1984.
[59] S. Yamada and S. Osaki, "Reliability growth modeling for software error detection,"IEEE Trans. Rel., vol. R-32, no. 5, pp. 475-478, 1983.
[60] S. Yamada and S. Osaki, "Software reliability growth modeling: Models and assumptions,"IEEE Trans. Software Eng., vol. SE-11, no. 12, pp. 1431-1437, Dec. 1985.

Index Terms:
KAT; knowledge-action-transformation; availability growth; single-component systems; knowledge models; action models; hyperexponential model; multicomponent systems; Markov models; knowledge based systems; Markov processes; reliability theory; software reliability
J.-C. Laprie, K. Kanoun, C. Beounes, M. Kaaniche, "The KAT (Knowledge-Action-Transformation) Approach to the Modeling and Evaluation of Reliability and Availability Growth," IEEE Transactions on Software Engineering, vol. 17, no. 4, pp. 370-382, April 1991, doi:10.1109/32.90436
Usage of this product signifies your acceptance of the Terms of Use.