|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2003 IEEE/WIC International Conference on Intelligent Agent Technology (IAT'03)
Q-Learning Automaton
Halifax, Canada
October 13-October 17
ISBN: 0-7695-1931-8
| ASCII Text | x | ||
| Fei Qian, Hironori Hirata, "Q-Learning Automaton," Intelligent Agent Technology, IEEE / WIC / ACM International Conference on, pp. 432, 2003 IEEE/WIC International Conference on Intelligent Agent Technology (IAT'03), 2003. | |||
| BibTex | x | ||
| @article{ 10.1109/IAT.2003.1241115, author = {Fei Qian and Hironori Hirata}, title = {Q-Learning Automaton}, journal ={Intelligent Agent Technology, IEEE / WIC / ACM International Conference on}, volume = {0}, year = {2003}, isbn = {0-7695-1931-8}, pages = {432}, doi = {http://doi.ieeecomputersociety.org/10.1109/IAT.2003.1241115}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Intelligent Agent Technology, IEEE / WIC / ACM International Conference on TI - Q-Learning Automaton SN - 0-7695-1931-8 SP EP A1 - Fei Qian, A1 - Hironori Hirata, PY - 2003 KW - null VL - 0 JA - Intelligent Agent Technology, IEEE / WIC / ACM International Conference on ER - | |||
Reinforcement Learning is the problem faced by a controller that must learn behavior through trial and error interactions with a dynamic environment. The controller's goal is to maximize reward over time, by producing an effective mapping of states to actions called policy. To construct the model of such systems, in this paper, we present a generalized learning automaton approach with Q-learning behaviors. Comparing to Q-learning, the computational experiments of the pursuit problems show that proposed reinforcement scheme obtains better results in terms of convergence speed and memory size.
Citation:
Fei Qian, Hironori Hirata, "Q-Learning Automaton," iat, pp.432, 2003 IEEE/WIC International Conference on Intelligent Agent Technology (IAT'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.
