Sciweavers

1760 search results - page 7 / 352
» Learning from Partial Observations
Sort
View
107
Voted
NIPS
1994
15 years 7 months ago
Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems
Tommi Jaakkola, Satinder P. Singh, Michael I. Jord...
129
Voted
COLT
2000
Springer
15 years 10 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
AAAI
2012
13 years 8 months ago
Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains
We present the first real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...
Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...
ICML
2010
IEEE
15 years 6 months ago
Telling cause from effect based on high-dimensional observations
Dominik Janzing, Patrik O. Hoyer, Bernhard Sch&oum...
COLING
2010
15 years 24 days ago
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...
Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...