Issue No.01 - Jan. (1986 vol.12)
C. S. Raghavendra , Department of Electrical Engineering - Systems, University of Southern California, Los Angeles, CA 90089
The reliability of distributed processing systems can be expressed in terms of the reliability of the processing elements that run the programs, the reliability of the processing elements holding the required files, and the reliability of the communication links used in file transfers. We introduce two reliability measures, namely, distributed program reliability and distributed system reliability to accurately model the reliability of distributed systems. The first measure describes the probability of successful execution of a distributed program which runs on some processing elements and needs to communicate with other processing elements for remote files, while the second measure describes the probability that all the programs of a given set can run successfully. The notion of minimal file spanning trees is introduced to efficiently evaluate these reliability measures. Graph theory techniques are used to systematically generate file spanning trees that provide all the required connections. Our technique is general and can be used in a dynamic environment for efficient reliability evaluation.
Computer network reliability, Software reliability, Distributed processing, Reliability theory, Indexes, Reliability engineering, spanning tree, Distributed program, distributed system, graph theory, reliability
C. S. Raghavendra, "Distributed program reliability analysis", IEEE Transactions on Software Engineering, vol.12, no. 1, pp. 42-50, Jan. 1986, doi:10.1109/TSE.1986.6312918