Sciweavers

3412 search results - page 131 / 683
» Efficient Reinforcement Learning
Sort
View
PRICAI
2000
Springer
15 years 10 months ago
Generating Hierarchical Structure in Reinforcement Learning from State Variables
This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...
Bernhard Hengst