loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2007 Frontiers in the Convergence of Bioscience and Information Technologies
Bounded Incremental Real-Time Dynamic Programming
Jeju Island, Korea
October 11-October 13
ISBN: 978-0-7695-2999-8
A real-time multi-step planning problem is characterized by alternating decision-making and execution processes, whole online decision-making time divided in slices between each execution, and the pressing need for policy that only relates to current step. We propose a new criterion to judge the optimality of a policy based on the upper and lower bound theory. This criterion guarantees that the agent can act earlier in a real-time decision process while an optimal policy with sufficient precision still remains. We prove that, under certain conditions, one can obtain an optimal policy with arbitrary precision using such an incremental method. We present a Bounded Incremental Real-Time Dynamic Programming algorithm (BIRTDP). In the experiments of two typical real-time simulation systems, BIRTDP outperforms the other state-of-the-art RTDP algorithms tested.
Citation:
Changjie Fan, Xiaoping Chen, "Bounded Incremental Real-Time Dynamic Programming," fbit, pp.637-644, 2007 Frontiers in the Convergence of Bioscience and Information Technologies, 2007
Usage of this product signifies your acceptance of the Terms of Use.