Issue No.06 - June (1997 vol.8)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/71.595574
<p><b>Abstract</b>—We present an efficient approach to characterizing the fault tolerance of multiprocessor systems that employ multiple shared buses for interprocessor communication. Of concern is connective fault tolerance, which is defined as the ability to maintain communication between any two fault-free processors in the presence of faulty processors, buses, or processor-bus links. We introduce a model called processor-bus-link (PBL) graphs to represent a multiple-bus system's interconnection structure. The model is more general than previously proposed models, and has the advantages of simple representation, broad application, and the ability to model partial bus failures. The PBL graph implies a set of component adjacency graphs that highlights various connectivity features of the system. Using these graphs, we propose a method for analyzing the maximum number of faults a multiple-bus system can tolerate, and for identifying every minimum set of faulty components that disconnects the processors of the system. We also analyze the connective fault tolerance of several proposed multiple-bus systems to illustrate the application of our method.</p>
Multiple-bus systems, multiprocessors, fault tolerance, graph models, interconnection methods.
Hung-Kuei Ku, John P. Hayes, "Connective Fault Tolerance in Multiple-Bus Systems", IEEE Transactions on Parallel & Distributed Systems, vol.8, no. 6, pp. 574-586, June 1997, doi:10.1109/71.595574