<p><b>Abstract</b>—This paper presents a model which can be used to characterize the diagnosability of Algorithm-Based Fault Tolerant (ABFT) systems. In the model, the relationship between processors computing useful data, the output data, and the check processors is defined in terms of matrix entries. Necessary and sufficient conditions for detecting and locating faults in the processors are derived, and based on them, efficient algorithms to evaluate the fault detection and location capabilities of the system are developed.</p>
Multiprocessor systems, faults and errors, system-level diagnosis, algorithm-based fault tolerance, modeling, and analysis.
