Search Sciweavers | Sciweavers

1760 search results - page 7 / 352

» Learning from Partial Observations

107

Voted

NIPS
1994

89views Information Technology» more NIPS 1994»

Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems

15 years 7 months ago

Download www.eecs.umich.edu

Tommi Jaakkola, Satinder P. Singh, Michael I. Jord...

claim paper

Read More »

129

Voted

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 10 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

214

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

13 years 8 months ago

Download www.intelligence.tuc.gr

We present the ﬁrst real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...

Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...

claim paper

Read More »

122

click to vote

ICML
2010
IEEE

175views Machine Learning» more ICML 2010»

Telling cause from effect based on high-dimensional observations

15 years 6 months ago

Download www.kyb.tuebingen.mpg.de

Dominik Janzing, Patrik O. Hoyer, Bernhard Sch&oum...

claim paper

Read More »

172

click to vote

COLING
2010

138views Computational Linguistics» more COLING 2010»

Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes

15 years 24 days ago

Download aclweb.org

This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...

Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...

claim paper

Read More »

« Prev « First page 7 / 352 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers