16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'04) An Empirical Evaluation of Interval Estimation for Markov Decision Processes Boca Raton, Florida November 15-November 17 ISBN: 0-7695-2236-X
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICTAI.2004.28
This paper takes an empirical approach to evaluating three model-based reinforcement-learning methods. All methods intend to speed the learning process by mixing exploitation of learned knowledge with exploration of possibly promising alternatives. We consider ε-greedy exploration, which is computationally cheap and popular, but unfocused in its exploration effort; R-Max exploration, a simplification of an exploration scheme that comes with a theoretical guarantee of efficiency; and a well-grounded approach, model-based interval estimation, that better integrates exploration and exploitation. Our experiments indicate that effective exploration can result in dramatic improvements in the observed rate of learning.
Citation:
Alexander L. Strehl, Michael L. Littman, "An Empirical Evaluation of Interval Estimation for Markov Decision Processes," ictai, pp.128-135, 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'04), 2004 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||