2003 IEEE/WIC International Conference on Intelligent Agent Technology (IAT'03) Q-Learning Automaton Halifax, Canada October 13-October 17 ISBN: 0-7695-1931-8
Reinforcement Learning is the problem faced by a controller that must learn behavior through trial and error interactions with a dynamic environment. The controller's goal is to maximize reward over time, by producing an effective mapping of states to actions called policy. To construct the model of such systems, in this paper, we present a generalized learning automaton approach with Q-learning behaviors.Comparing to Q-learning, the computational experiments of the pursuit problems show that proposed reinforcement scheme obtains better results in terms of convergence speed and memory size.
Citation:
Fei Qian, Hironori Hirata, "Q-Learning Automaton," iat, pp.432, 2003 IEEE/WIC International Conference on Intelligent Agent Technology (IAT'03), 2003 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||