Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2 (AAMAS'04)
Best-Response Multiagent Learning in Non-Stationary Environments
New York City, New York, USA
July 19-July 23
ISBN: 0-7695-2092-8
This paper investigates a relatively new direction in Multiagent Reinforcement Learning. Most multiagent learning techniques focus on Nash equilibria as elements of both the learning algorithm and its evaluation criteria. In contrast, we propose a multiagent learning algorithm that is optimal in the sense of finding a best-response policy, rather than in reaching an equilibrium. We present the first learning algorithm that is provably optimal against restricted classes of non-stationary opponents. The algorithm infers an accurate model of the opponent?s non-stationary strategy, and simultaneously creates a best-response policy against that strategy. Our learning algorithm works within the very general framework of n-player, general-sum stochastic games, and learns both the game structure and its associated optimal policy.
Citation:
Michael Weinberg, Jeffrey S. Rosenschein, "Best-Response Multiagent Learning in Non-Stationary Environments," aamas, vol. 2, pp.506-513, Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2 (AAMAS'04), 2004