Issue No. 04 - October-December (2004 vol. 1)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TDSC.2004.40
Andrea Bondavalli , IEEE
This paper proposes a novel methodology and an architectural framework for handling multiple classes of faults (namely, hardware-induced software errors in the application, process and/or host crashes or hangs, and errors in the persistent system stable storage) in a COTS and Legacy-based application. The basic idea is to use an evidence-accruing fault tolerance manager to choose and carry out one of multiple fault recovery strategies, depending upon the perceived severity of the fault. The methodology and the framework have been applied to a case study system consisting of a Legacy system, which makes use of a COTS DBMS for persistent storage facilities. A thorough performability analysis has also been conducted via combined use of direct measurements and analytical modeling. Experimental results demonstrate that effective fault treatment, consisting of careful diagnosis and damage assessment, plays a key role in leveraging the dependability of COTS and Legacy-based applications.
Legacy systems and COTS components, fault diagnosis and treatment, fault injection, modeling and evaluation, performability.
L. Romano, D. Cotroneo, A. Bondavalli and S. Chiaradonna, "Effective Fault Treatment for Improving the Dependability of COTS and Legacy-Based Applications," in IEEE Transactions on Dependable and Secure Computing, vol. 1, no. , pp. 223-237, 2004.