Issue No.09 - September (2009 vol.58)
Srinivasan M. Iyer , Exalead, Inc., San Francisco
Alexandros V. Gerbessiotis , New Jersey Institute of Technology, Newark
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TC.2009.31
We develop a continuous-time Markov chain model of a dependability system operating in a randomly changing environment and subject to probabilistic cascading failures. A cascading failure can be thought of as a rooted tree. The root is the component whose failure triggers the cascade, its children are those components that the root's failure immediately caused, the next generation are those components whose failures were immediately caused by the failures of the root's children, and so on. The amount of cascading is unlimited. We consider probabilistic cascading in the sense that the failure of a component of type i causes a component of type j to fail simultaneously with a given probability, with all failures in a cascade being mutually independent. Computing the infinitesimal generator matrix of the Markov chain poses significant challenges because of the exponential growth in the number of trees one needs to consider as the number of components failing in the cascade increases. We provide a recursive algorithm generating all possible trees corresponding to a given transition, along with an experimental study of an implementation of the algorithm on two examples. The numerical results highlight the effects of cascading on the dependability of the models.
Availability, reliability modeling, Markov processes, trees, cascading failures.
Srinivasan M. Iyer, Alexandros V. Gerbessiotis, "A Markovian Dependability Model with Cascading Failures", IEEE Transactions on Computers, vol.58, no. 9, pp. 1238-1249, September 2009, doi:10.1109/TC.2009.31