Search Sciweavers | Sciweavers

181 search results - page 28 / 37

» On Policy Learning in Restricted Policy Spaces

191

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 10 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

150

click to vote

ICML
2004
IEEE

158views Machine Learning» more ICML 2004»

Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning

16 years 7 months ago

Download www.eecs.umich.edu

Reminder systems support people with impaired prospective memory and/or executive function, by providing them with reminders of their functional daily activities. We integrate tem...

Matthew R. Rudary, Satinder P. Singh, Martha E. Po...

claim paper

Read More »

157

click to vote

ECML
2004
Springer

100views Machine Learning» more ECML 2004»

Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework

15 years 11 months ago

Download bi.snu.ac.kr

Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...

Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...

claim paper

Read More »

157

click to vote

ECML
2007
Springer

133views Machine Learning» more ECML 2007»

Imitation Learning Using Graphical Models

16 years 9 days ago

Download www.cs.washington.edu

Imitation-based learning is a general mechanism for rapid acquisition of new behaviors in autonomous agents and robots. In this paper, we propose a new approach to learning by imit...

Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

163

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

16 years 9 days ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

« Prev « First page 28 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers