Issue No. 01 - January (1990 vol. 39)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/12.46280
<p>A general procedure for error detection in complex systems, called the data block capture and analysis monitoring process, is described and analyzed. It is assumed that, in addition to being exposed to potential external fault sources, a complex system will in general always contain embedded hardware and software fault mechanisms which can cause the system to perform incorrect computations and/or produce incorrect output. Thus, in operation, the system continuously moves back and forth between error and no-error states. These external fault sources or internal fault mechanisms are extremely difficult to detect. The data block capture and analysis monitoring process is concerned with detecting deviations from the normal performance of the system, known as errors, which are symptomatic of fault conditions. The process consists of repeatedly recording a fixed amount of data from a set of predetermined observation lines of the system being monitored (i.e. capturing a block of data) and then analyzing the captured block in an attempt to determine whether the system is functioning correctly.</p>
generalized concurrent error detection procedure; data block capture; analysis monitoring process; external fault sources; fault mechanisms; error detection; fault tolerant computing; performance evaluation.
D. Blough and G. Masson, "Performance Analysis of a Generalized Concurrent Error Detection Procedure," in IEEE Transactions on Computers, vol. 39, no. , pp. 47-62, 1990.