12th Pacific Rim International Symposium on Dependable Computing (PRDC'06)
An OS-level Framework for Providing Application-Aware Reliability
Riverside, California
December 18-December 20
ISBN: 0-7695-2724-8
Long Wang, University of Illinois at Urbana-Champaign, IL
Weining Gu, University of Illinois at Urbana-Champaign, IL
The paper describes the reliability microkernel framework (RMK), a loadable kernel module for providing application-aware reliability and dynamically configuring reliability mechanisms installed in RMK. The RMK prototype is implemented in Linux and supports detection of application/OS failures and transparent application checkpointing. Experiment results show that the OS hang detection, which exploits characteristics of application and system behavior, can achieve high coverage (100% in our experiments) and low false positive rate. Moreover, the performance overhead is negligible because instruction counting is performed in hardware.
Citation:
Long Wang, Zbigniew Kalbarczyk, Weining Gu, Ravishankar K. Iyer, "An OS-level Framework for Providing Application-Aware Reliability," prdc, pp.55-62, 12th Pacific Rim International Symposium on Dependable Computing (PRDC'06), 2006