loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3 (AAMAS'04)
New York City, New York, USA
July 19-July 23
ISBN: 0-7695-2092-8
Stefan Bieniawski, Stanford University
David H. Wolpert, NASA Ames Research Center
Product Distribution (PD) theory was recently developed as a framework for analyzing and optimizing distributed systems. In this paper we demonstrate its use for adaptive distributed control of Multi-Agent Systems (MAS?s), i.e., for distributed stochastic optimization using MAS?s. One common way to perform the optimization is to have each agent run a Reinforcement Learning (RL) algorithm. PD theory provides an alternative based upon using a variant of Newton?s method operating on the agent?s probability distributions. We compare this alternative to RL-based search in three sets of computer experiments. The PD-theory-based approach outperforms the RL-based scheme in all three domains.
Citation:
Stefan Bieniawski, David H. Wolpert, "Adaptive, Distributed Control of Constrained Multi-Agent Systems," aamas, vol. 3, pp.1230-1231, Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3 (AAMAS'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.