Issue No. 02 - March/April (2005 vol. 9)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/MIC.2005.31
Zbigniew Kalbarczyk , University of Illinois, Urbana-Champaign
Ravishankar K. Iyer , University of Illinois, Urbana-Champaign
Long Wang , University of Illinois, Urbana-Champaign
Many current approaches to software-implemented fault tolerance (SIFT) rely on process replication, which is often prohibitively expensive for practical use due to its high performance overhead and cost. The Adaptive Reconfigurable Mobile Objects of Reliability (Armor) middleware architecture offers a scalable low-overhead way to provide high-dependability services to applications. It uses coordinated multithreaded processes to manage redundant resources acrossinterconnected nodes, detect errors in user applications and infrastructural components, and provide failure recovery. The authors describe their experiences and lessons learned in deploying Armor in several diverse fields.
software-implemented fault tolerance, high-dependability services, failure recovery, middleware
L. Wang, R. K. Iyer and Z. Kalbarczyk, "Application Fault Tolerance with Armor Middleware," in IEEE Internet Computing, vol. 9, no. , pp. 28-37, 2005.