In today?s open networking environment, the assumption that the learning agents that join a system are homogeneous is becoming increasingly unrealistic. This makes ef fective coordination particularly dif ficult to learn, especially in the absence of learning agent standards. In this short paper we investigate the problem of learning to coordinate with heterogeneous agents. We show that an agent employing the FMQ algorithm, a recently developed multiagent learning method, has the ability to converge towards the optimal joint action when teamed-up with one or more simple Q-learners. Specifically, we show such convergence in scenarios where simple Q-learners alone are unable to converge towards an optimum.
Citation:
Spiros Kapetanakis, Daniel Kudenko, "Reinforcement Learning of Coordination in Heterogeneous Cooperative Multi-Agent Systems," aamas, vol. 3, pp.1258-1259, Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3 (AAMAS'04), 2004