Search Sciweavers | Sciweavers

168 search results - page 17 / 34

» Optimism in Reinforcement Learning Based on Kullback-Leibler...

147

click to vote

AAAI
2010

134views Intelligent Agents» more AAAI 2010»

Reinforcement Learning Via Practice and Critique Advice

15 years 7 months ago

Download web.engr.oregonstate.edu

We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our setting, the learner alternates between practicing, where learning is based on ac...

Kshitij Judah, Saikat Roy, Alan Fern, Thomas G. Di...

claim paper

Read More »

214

click to vote

CVPR
2011
IEEE

446views Computer Vision» more CVPR 2011»

Shape Grammar Parsing via Reinforcement Learning

15 years 2 months ago

Download www.mas.ecp.fr

This paper tackles shape grammar parsing for facade segmentation using a novel optimization approach based on reinforcement learning (RL). To this end, we use a binary recursive g...

Olivier Teboul, Iasonas Kokkinos, Panagiotis Kouts...

claim paper

Read More »

170

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

16 years 15 days ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

161

click to vote

AIMSA
2004
Springer

104views Artificial Intelligence» more AIMSA 2004»

Towards Well-Defined Multi-agent Reinforcement Learning

15 years 9 months ago

Download userweb.port.ac.uk

Multi-agent reinforcement learning (MARL) is an emerging area of research. However, it lacks two important elements: a coherent view on MARL, and a well-defined problem objective. ...

Rinat Khoussainov

claim paper

Read More »

144

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

15 years 26 days ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

« Prev « First page 17 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers