16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'04)
An Empirical Evaluation of Interval Estimation for Markov Decision Processes
Boca Raton, Florida
November 15-November 17
ISBN: 0-7695-2236-X
This paper takes an empirical approach to evaluating three model-based reinforcement-learning methods. All methods intend to speed the learning process by mixing exploitation of learned knowledge with exploration of possibly promising alternatives. We consider ε-greedy exploration, which is computationally cheap and popular, but unfocused in its exploration effort; R-Max exploration, a simplification of an exploration scheme that comes with a theoretical guarantee of efficiency; and a well-grounded approach, model-based interval estimation, that better integrates exploration and exploitation. Our experiments indicate that effective exploration can result in dramatic improvements in the observed rate of learning.
Citation:
Alexander L. Strehl, Michael L. Littman, "An Empirical Evaluation of Interval Estimation for Markov Decision Processes," ictai, pp.128-135, 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'04), 2004