10th Pacific Rim International Symposium on Dependable Computing (PRDC'04)
A Dynamic Checkpointing Scheme Based on Reinforcement Learning
Papeete, Tahiti, French Polynesia
March 03-March 05
ISBN: 0-7695-2076-6
In this paper, we develop a new checkpointing scheme for a uniprocess application. First, we model the checkpointing scheme by a semi-Markov decision process, and apply the reinforcement learning algorithm to estimate statistically the optimal checkpointing policy. More specifically, the representative reinforcement learning algorithm, called the Q-learning algorithm, is used to develop an adaptive checkpointing scheme. In simulation experiments, we examine the asymptotic behavior of the system overhead with adaptive checkpointing and show quantitatively that the proposed dynamic checkpoint algorithm is useful and robust under an incomplete knowledge on the failure time distribution.
Index Terms:
Dynamic checkpointing, Uniprocess application, Semi-Markov decision process, Reinforcement learning, Q-learning
Citation:
Hiroyuki Okamura, Yuki Nishimura, Tadashi Dohi, "A Dynamic Checkpointing Scheme Based on Reinforcement Learning," prdc, pp.151-158, 10th Pacific Rim International Symposium on Dependable Computing (PRDC'04), 2004