This Article 
 Bibliographic References 
 Add to: 
Optimal and Efficient Probabilistic Distributed Diagnosis Schemes
July 1993 (vol. 42 no. 7)
pp. 882-886

The distributed self-diagnosis of a multiprocessor/multicomputer system based on interprocessor tests with imperfect fault coverage that permits intermittently faulty processors is addressed. Focusing on probabilistic diagnosis methods, the authors define several different categories of probabilistic diagnosis based on the type of fault syndrome information used in the diagnosis. Rigorous probabilistic analysis is then used to derive diagnosis algorithms optimal in terms of diagnostic accuracy for the diagnosis categories introduced. Analysis and simulations are used to evaluate the performance of the diagnosis algorithms introduced.

[1] D.M. Blough, G.F. Sullivan, and G.M. Masson, "Almost certain diagnosis for intermittently faulty systems," inProc. 18th Int. Symp. Fault-Tolerant Comput., 1988, pp. 260-271.
[2] D.M. Blough, "Fault detection and diagnosis in multiprocessor systems," Ph.D. dissertation, The Johns Hopkins Univ., Baltimore, MD, 1988.
[3] M. L. Blount, "probabilistic treatment of diagnosis in digital systems," inDig. Papers, FTCS-7, 1977, pp. 72-77.
[4] A. T. Dahbura and G. M. Masson, "Greedy diagnosis of hybrid fault situations,"IEEE Trans. Comput., vol. C-32, no. 8, pp. 777-782, Aug. 1983.
[5] A. Dahbura, K. K. Sabnani, and L. L. King, "The comparison approach to multiprocessors fault diagnosis,"IEEE Trans. Comput., vol. C-36, pp. 373-378, Mar. 1987.
[6] S. Lee, "Probabilistic multiprocessor and multicomputer diagnosis," Ph.D. dissertation, Univ. Michigan, Ann Arbor, 1990.
[7] S. Mallela and G. M. Masson, "Diagnosable systems for intermittent faults,"IEEE Trans. Comput., vol. C-27, no. 6, pp. 560-566, June 1978.
[8] F. P. Preparata, G. Metze, and R. T. Chien, "On the connection assignment problem of diagnosable systems,"IEEE Trans. Electron. Comput., vol. EC-16, no. 6, pp. 848-854, Dec. 1967.
[9] S. Rangarajan and D. Fussell, "Diagnosing arbitrarily connected parallel computers with high probability,"IEEE Trans. Comput., vol. 41, no. 5, pp. 606-615, May 1992.

Index Terms:
performance evaluation; probabilistic distributed diagnosis schemes; distributed self-diagnosis; multiprocessor; multicomputer system; interprocessor tests; imperfect fault coverage; intermittently faulty processors; probabilistic diagnosis methods; fault syndrome information; diagnosis categories; simulations; diagnosis algorithms; fault tolerant computing; multiprocessing systems.
S. Lee, K.G. Shin, "Optimal and Efficient Probabilistic Distributed Diagnosis Schemes," IEEE Transactions on Computers, vol. 42, no. 7, pp. 882-886, July 1993, doi:10.1109/12.237729
Usage of this product signifies your acceptance of the Terms of Use.