loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth International Conference on Multi-Agent Systems (ICMAS'00)
Evaluating Concurrent Reinforcement Learners
Boston, Massachusetts
July 10-July 12
ISBN: 0-7695-0625-9
Manisha Mundhe, University of Tulsa
Sandip Sen, University of Tulsa
Assumptions underlying the convergence proofs of reinforcement learning (RL) algorithms like Q-learning are violated when multiple interacting agents adapt their strategies on-line because of learning. Empirical investigations in several domains, however, have produced encouraging results. We evaluate the convergence behavior of concurrent reinforcement learning agents using game matrices as studied by Claus and Boutilier [1]. Variants of simple RL algorithms are evaluated for convergence under increasing number of agents per group, scale up of game matrix size, delayed feedback and game matrix characteristics. Our results show surprising departures from that observed by Claus and Boutilier, particular for larger problem sizes.
Citation:
Manisha Mundhe, Sandip Sen, "Evaluating Concurrent Reinforcement Learners," icmas, pp.0421, Fourth International Conference on Multi-Agent Systems (ICMAS'00), 2000
Usage of this product signifies your acceptance of the Terms of Use.