2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT'05)
Self-Organizing Cognitive Agents and Reinforcement Learning in Multi-Agent Environment
Compi?gne University of Technology, France
September 19-September 22
ISBN: 0-7695-2416-8
DOI Bookmark:
http://doi.ieeecomputersociety.org/10.1109/IAT.2005.125
This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value functions of the state-action space estimated through a temporal difference (TD) method. The learned value functions are then used to determine the optimal actions based on an action selection policy. We present a specific instance of TD-FALCON based on an e-greedy action policy and a Q-learning value estimation formula. Experiments based on a minefield navigation task and a minefield pursuit task show that TD-FALCON systems are able to adapt and function well in a multi-agent environment without an explicit mechanism for collaboration.
Citation:
Ah-Hwee Tan, Dan Xiao, "Self-Organizing Cognitive Agents and Reinforcement Learning in Multi-Agent Environment," iat, pp.351-357, 2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT'05), 2005
Usage of this product signifies your acceptance of the
Terms of Use.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||