This Article 
 Bibliographic References 
 Add to: 
On the Computational Aspects of Performability Models of Fault-Tolerant Computer Systems
June 1990 (vol. 39 no. 6)
pp. 832-836

It is shown that the (scaled) conditional moments of performability in Markov models are the states of a cascaded, linear, continuous-time dynamic system with identical system matrices in each stage. This interpretation leads to a simple method of computing the first moment for nonhomogeneous Markov models with finite mission time. In addition, the cascaded system representation leads to the derivation of a set of two stable algorithms for propagating the conditional moments of performability in homogeneous Markov models. In particular, a very fast doubling algorithm using diagonal Pade approximation to compute the matrix exponential and repeated squaring is derived. The algorithms are widely recognized, to be superior to those based on eigenvalue analysis in terms of both the computational efficiency and stability. The algorithms have obvious implications in solving reliability/availability models with large mission times.

[1] B. R. Iyer, L. Donatiello and P. Heidelberger, "Analysis of performability for stochastic models of fault-tolerant systems,"IEEE Trans. Comput., vol. C-35, no. 10, Oct. 1986.
[2] J. F. Meyer, "On evaluating performability of degradable computing systems,"IEEE Trans. Comput., vol. C-29, no. 8, pp. 720-731, Aug. 1980.
[3] V. G. Kulkarni, V. F. Nicola, R. M. Smith, and K. S. Trivedi, "Numerical evaluation of performability measures and job completion time in repairable fault-tolerant systems," inProc. 1986 Int. Symp. Fault-Tolerant Comput., Vienna, Austria, 1986, pp. 252-257.
[4] R. M. Smith, K. S. Trivedi, and A. V. Ramesh, "Performability analysis: Measures, an algorithm and a case study,"IEEE Trans. Comput., vol. C-37, no. 4, 1988.
[5] A. Goyal and A. N. Tantawi, "A measure of guaranteed availability and its numerical evaluation,"IEEE Trans. Comput., vol. C-37, no. 1, pp. 25-32, Jan. 1988.
[6] E. de Souza e Silva and H. R. Gail, "Calculating cumulative operational time distributions of repairable computer systems,"IEEE Trans. Comput., vol. C-35, pp. 322-332, 1986.
[7] C. Moler and C. F. Van Loan, "Nineteen dubious ways to compute the exponential of a matrix,"SIAM Rev., vol. 20, no. 4, pp. 801-836, Oct. 1978.
[8] J. J. Siffler and L. A. Bryant, "CARE III phase II report-- Mathematical description," NASA CR-3566, Nov. 1982.
[9] R. M. Geist and K. S. Trivedi, "Ultrahigh reliability prediction of fault-tolerant computer systems,"IEEE Trans. Comput., vol. C-32, no. 12, pp. 1118-1127, Dec. 1983.
[10] K. S. Trivedi, and J. B. Dugan, "Hybrid reliability modeling of fault-tolerant computer systems,"Comput. Elec. Eng., vol. 11, no. 2-3, 1984.
[11] L. F. Shampine and H. A. Watts, "Practical solution of ordinary differential equations by Runge-Kutta methods," Sandia Lab. Rep. SAND 76-0585, Albuquerque, NM, 1976.
[12] L. F. Shampine and M. K. Gordon,Computer Solution of Ordinary Differential Equations--The Initial Value Problem. San Francisco, CA: Freeman, 1975.
[13] J. Starner, "Numerical solution of implicit differential-algebraic equations," Ph.D. dissertation, Univ. of New Mexico, Albuquerque, NM, 1976.
[14] C. F. Van Loan, "Computing integrals involving the matrix exponentials,"IEEE Trans. Automat. Contr., vol. AC-23, no. 3, pp. 395-404, June 1978.
[15] R. C. Ward, "Numerical computation of the matrix exponential with accuracy estimate,"SIAM J. Numer. Anal., vol. 14, pp. 600-610, 1977.
[16] A. E. Bryson and Y. C. Ho,Applied Optimal Control. New York: Wiley, 1969.
[17] R. A. Howard,Dynamic Programming and Markov Processes. Cambridge, MA: MIT Press, 1960.
[18] K. R. Pattipati, M. Kastner, S. Dunham, J. Teele, and J. C. Deckert, "Fault-tolerant computer architecture modeling and analysis," First Annu. Rep., ALPHATECH, Inc., Oct. 1986.
[19] G. H. Golub and C. Van Loan,Matrix Computations. Baltimore, MD: Johns Hopkins Press, 1984.
[20] S. Karlin and H. M. Taylor,A First Course in Stochastic Processes. New York: Academic, 1975.
[21] S. Ross,Introduction to Probability Models. New York: Academic, 1985.
[22] R. Ward, "Statistical roundoff error analysis of a Pade approximation to the matrix exponential," inPade and Rational Approximation: Theory and Applications, E. B. Saff and R. S. Varga, Eds. New York: Academic, 1977.
[23] K. R. Pattipati, Y. Li, and H. A. P. Blom, "On the instantaneous availability and performability evaluation of fault-tolerant computer systems," inProc. IEEE Int. Conf. Syst., Man, Cybern., Cambridge, MA, Nov. 1989.

Index Terms:
scaled conditional moments; computational aspects; performability models; fault-tolerant computer systems; Markov models; continuous-time dynamic system; finite mission time; cascaded system representation; stable algorithms; doubling algorithm; diagonal Pade approximation; reliability; availability models; approximation theory; fault tolerant computing; Markov processes.
K.R. Pattipati, S.A. Shah, "On the Computational Aspects of Performability Models of Fault-Tolerant Computer Systems," IEEE Transactions on Computers, vol. 39, no. 6, pp. 832-836, June 1990, doi:10.1109/12.53605
Usage of this product signifies your acceptance of the Terms of Use.