Sciweavers

536 search results - page 32 / 108
» Residual Algorithms: Reinforcement Learning with Function Ap...
Sort
View
NIPS
2001
15 years 7 months ago
The Steering Approach for Multi-Criteria Reinforcement Learning
We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...
Shie Mannor, Nahum Shimkin
GECCO
2009
Springer
124views Optimization» more  GECCO 2009»
15 years 10 months ago
Reinforcement learning for games: failures and successes
We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...
Wolfgang Konen, Thomas Bartz-Beielstein
ICASSP
2011
IEEE
14 years 9 months ago
Image prediction based on non-negative matrix factorization
This paper presents a novel spatial texture prediction method based on non-negative matrix factorization. As an extension of template matching, approximation based iterative textu...
Mehmet Türkan, Christine Guillemot
ROBOCUP
2007
Springer
102views Robotics» more  ROBOCUP 2007»
16 years 20 hour ago
Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents
This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning a...
Luiz A. Celiberto, Carlos H. C. Ribeiro, Anna Hele...
ECML
2006
Springer
15 years 9 months ago
Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery
Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...
Scott Proper, Prasad Tadepalli