Sciweavers

8232 search results - page 376 / 1647
» Dynamic Logic Programming
Sort
View
CDC
2010
IEEE
136views Control Systems» more  CDC 2010»
15 years 1 months ago
Pathologies of temporal difference methods in approximate dynamic programming
Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...
Dimitri P. Bertsekas
181
Voted
CDC
2010
IEEE
139views Control Systems» more  CDC 2010»
15 years 1 months ago
Q-learning and enhanced policy iteration in discounted dynamic programming
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...
Dimitri P. Bertsekas, Huizhen Yu
INFORMS
2011
120views more  INFORMS 2011»
15 years 1 months ago
Solving Talent Scheduling with Dynamic Programming
Maria Garcia de la Banda, Peter J. Stuckey, Geoffr...