7th IEEE International Conference on Computer and Information Technology (CIT 2007) Performance Evaluation of TD-Learning Methods for Bandwidth Provisioning Aizu-Wakamatsu City, Fukushima, Japan October 16-October 19 ISBN: 0-7695-2983-6
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CIT.2007.131
Q-learning and SARSA are two methods of TD- learning. Researchers interested in this field proposed the Eligibility concept in order to speed up Q-learning and SARSA. They proved their claim by running the algorithms in a static environment. Authors of this paper have used Q-learning, SARSA and also their eligibility versions for bandwidth provisioning in DiffServ networks that is an absolutely dynamic environment. Performance of these methods in this absolutely dynamic environment is evaluated. Keyword - Queuing delay guarantee, Bandwidth provisioning, DiffServ architecture, TD-learning
Citation:
M. Jahanshahi, M. R. Meybodi, "Performance Evaluation of TD-Learning Methods for Bandwidth Provisioning," cit, pp.171-176, 7th IEEE International Conference on Computer and Information Technology (CIT 2007), 2007 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||