Search Sciweavers | Sciweavers

536 search results - page 32 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

178

click to vote

NIPS
2001

131views Information Technology» more NIPS 2001»

The Steering Approach for Multi-Criteria Reinforcement Learning

15 years 7 months ago

Download books.nips.cc

We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

180

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

15 years 10 months ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

165

click to vote

ICASSP
2011
IEEE

155views Signal Processing» more ICASSP 2011»

Image prediction based on non-negative matrix factorization

14 years 9 months ago

Download mirlab.org

This paper presents a novel spatial texture prediction method based on non-negative matrix factorization. As an extension of template matching, approximation based iterative textu...

Mehmet Türkan, Christine Guillemot

claim paper

Read More »

151

click to vote

ROBOCUP
2007
Springer

102views Robotics» more ROBOCUP 2007»

Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents

16 years 20 hour ago

Download www.fei.edu.br

This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning a...

Luiz A. Celiberto, Carlos H. C. Ribeiro, Anna Hele...

claim paper

Read More »

165

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 9 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

« Prev « First page 32 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers