Search Sciweavers | Sciweavers

2467 search results - page 301 / 494

» Finite State Machines

158

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 7 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

174

click to vote

ICML
2005
IEEE

195views Machine Learning» more ICML 2005»

Active learning for Hidden Markov Models: objective functions and algorithms

16 years 7 months ago

Download www.cs.cmu.edu

Hidden Markov Models (HMMs) model sequential data in many fields such as text/speech processing and biosignal analysis. Active learning algorithms learn faster and/or better by cl...

Brigham Anderson, Andrew Moore

claim paper

Read More »

129

click to vote

ICML
2005
IEEE

104views Machine Learning» more ICML 2005»

Fast condensed nearest neighbor rule

16 years 7 months ago

Download www.machinelearning.org

We present a novel algorithm for computing a training set consistent subset for the nearest neighbor decision rule. The algorithm, called FCNN rule, has some desirable properties....

Fabrizio Angiulli

claim paper

Read More »

178

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 7 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

159

click to vote

ICML
2003
IEEE

168views Machine Learning» more ICML 2003»

Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning

16 years 7 months ago

Download webee.technion.ac.il

We present a novel Bayesian approach to the problem of value function estimation in continuous state spaces. We define a probabilistic generative model for the value function by i...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 301 / 494 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers