loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'04)
An Empirical Evaluation of Interval Estimation for Markov Decision Processes
Boca Raton, Florida
November 15-November 17
ISBN: 0-7695-2236-X
Alexander L. Strehl, Rutgers University
Michael L. Littman, Rutgers University
This paper takes an empirical approach to evaluating three model-based reinforcement-learning methods. All methods intend to speed the learning process by mixing exploitation of learned knowledge with exploration of possibly promising alternatives. We consider ε-greedy exploration, which is computationally cheap and popular, but unfocused in its exploration effort; R-Max exploration, a simplification of an exploration scheme that comes with a theoretical guarantee of efficiency; and a well-grounded approach, model-based interval estimation, that better integrates exploration and exploitation. Our experiments indicate that effective exploration can result in dramatic improvements in the observed rate of learning.
Citation:
Alexander L. Strehl, Michael L. Littman, "An Empirical Evaluation of Interval Estimation for Markov Decision Processes," ictai, pp.128-135, 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.