This Article 
 Bibliographic References 
 Add to: 
On Self-Fault Diagnosis of the Distributed Systems
February 1988 (vol. 37 no. 2)
pp. 248-251
The problem of achieving fault diagnosis in a network of interconnected processing elements (called nodes) is considered. It is assumes that there is no central facility to control, coordinate or mediate among the processing elements. Every node can eventually determine the status of nodes and communication paths between them. A diagnostic algorithm for homogeneous systems (systems with only te

[1] H. Sulivan and T. R. Bashkov, "A large scale homogeneous, fully distributed parallel machine, I," inProc. 4th Symp. Comput. Arch., March 1977, pp. 105-117.
[2] L. D. Wittie, "MICRONET: A reconfigureable microcomputer network for distributed systems research,"Simulation. vol. 31, pp. 145- 153, Nov. 1978.
[3] C. C. Reames and M. T. Liu, "Design and simulation of the distributed loop computer network (DLCN)," inProc. 3rd Annu. Symp. Comput. Architecture, Clearwater, FL, Jan. 1976, pp. 124-129.
[4] F. P. Preparata, G. Metze, and R. T. Chien, "On the connection assignment problem of diagnosable systems,"IEEE Trans. Comput., vol. EC-16, pp. 848-854, Dec. 1967.
[5] J. G. Kuhl and S. M. Reddy, "Distributed fault-tolerance for large multiprocessor system," inProc. 1980 Comput. Architecture Conf., France, May 1980.
[6] J. Kuhl and S. Reddy, "Fault-diagnosis in fully distributed systems," inProc. Eleventh Int. Conf. Fault-Tolerant Comput., June 1981.
[7] C. S. Holt and J. E. Smith, "Self-diagnosis in distributed systems,"IEEE Trans. Comput., vol. C-34, pp. 19-32, Jan. 1985.
[8] C. C. Liaw, Y. K. Mailiya, and S. Y. H. Su, "Self diagnosis of nonhomogeneous distributed systems," inProc. Twelfth Int. Symp. Fault-Tolerant Comput., June 1982, pp. 349-352.
[9] S. H. Hosseini, "Fault-tolerance in distributed computing systems and database," Ph.D. dissertation, Dep. Elec. Eng. Comput. Sci., Univ. Iowa, Aug. 1982.
[10] P. Ciompi, F. Grandoni, and L. Simoncini, "Distributed diagnosis in multiprocessor systems: The MuTEAM approach," inProc. Eleventh Int. Symp. Fault-Tolerant Comput., June 1981.
[11] F. Barsi, F. Grandoni, and P. Maestrini, "A theory of diagnosability of digital systems,"IEEE Trans. Comput., vol. C-25, pp. 585-593, June 1976.
[12] J. D. Russel and C. R. Kime, "System fault diagnosis: Closure and diagnosability with repair,"IEEE Trans. Comput., vol. C-24, pp. 1078-1089, Nov. 1975.

Index Terms:
self-testing; self-fault diagnosis; distributed systems; network; interconnected processing elements; diagnostic algorithm; homogeneous systems; inhomogeneous systems; automatic testing; computer testing; distributed processing; fault location.
S.H. Hosseini, J.G. Kuhl, S.M. Reddy, "On Self-Fault Diagnosis of the Distributed Systems," IEEE Transactions on Computers, vol. 37, no. 2, pp. 248-251, Feb. 1988, doi:10.1109/12.2158
Usage of this product signifies your acceptance of the Terms of Use.