Issue No. 01 - February (1994 vol. 14)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/40.259902
<p>As the demand for highly parallel systems grows, the vast amount of concurrently operating hardware involved can make it difficult to guarantee proper system behavior. Problems arise both from permanent and transient hardware faults and from errors caused by improper programming. A number of fault tolerance solutions have emerged. Following a survey of fault tolerance in arrays, a discussion of solutions for more specialized architectures is presented.</p>
K. Grosspietsch, "Fault Tolerance in Highly Parallel Hardware Systems," in IEEE Micro, vol. 14, no. , pp. 60-68, 1994.