Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3 (AAMAS'04) New York City, New York, USA July 19-July 23 ISBN: 0-7695-2092-8
We present a distributed variant of Q-learning that allows to learn the optimal cost-to-go function in stochastic cooperative multi-agent domains without communication between the agents.
Citation:
Martin Lauer, Martin Riedmiller, "Reinforcement Learning for Stochastic Cooperative Multi-Agent Systems," aamas, vol. 3, pp.1516-1517, Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3 (AAMAS'04), 2004 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||