7th IEEE International Conference on Computer and Information Technology (CIT 2007)
Performance Evaluation of TD-Learning Methods for Bandwidth Provisioning
Aizu-Wakamatsu City, Fukushima, Japan
October 16-October 19
ISBN: 0-7695-2983-6
Q-learning and SARSA are two methods of TD- learning. Researchers interested in this field proposed the Eligibility concept in order to speed up Q-learning and SARSA. They proved their claim by running the algorithms in a static environment. Authors of this paper have used Q-learning, SARSA and also their eligibility versions for bandwidth provisioning in DiffServ networks that is an absolutely dynamic environment. Performance of these methods in this absolutely dynamic environment is evaluated. Keyword - Queuing delay guarantee, Bandwidth provisioning, DiffServ architecture, TD-learning
Citation:
M. Jahanshahi, M. R. Meybodi, "Performance Evaluation of TD-Learning Methods for Bandwidth Provisioning," cit, pp.171-176, 7th IEEE International Conference on Computer and Information Technology (CIT 2007), 2007