Search Sciweavers | Sciweavers

3502 search results - page 217 / 701

» From Machine Learning to Machine Reasoning

166

click to vote

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

16 years 7 months ago

Download www.ri.cmu.edu

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...

Geoffrey J. Gordon

claim paper

Read More »

159

click to vote

NIPS
2007

124views Information Technology» more NIPS 2007»

A Risk Minimization Principle for a Class of Parzen Estimators

15 years 8 months ago

Download books.nips.cc

This paper1 explores the use of a Maximal Average Margin (MAM) optimality principle for the design of learning algorithms. It is shown that the application of this risk minimizati...

Kristiaan Pelckmans, Johan A. K. Suykens, Bart De ...

claim paper

Read More »

164

click to vote

COLT
2003
Springer

104views Machine Learning» more COLT 2003»

Learning Random Log-Depth Decision Trees under the Uniform Distribution

15 years 11 months ago

Download www.cs.columbia.edu

We consider three natural models of random logarithmic depth decision trees over Boolean variables. We give an eﬃcient algorithm that for each of these models learns all but an ...

Jeffrey C. Jackson, Rocco A. Servedio

claim paper

Read More »

184

click to vote

ICML
2008
IEEE

157views Machine Learning» more ICML 2008»

Efficiently learning linear-linear exponential family predictive representations of state

16 years 7 months ago

Download web.mit.edu

Exponential Family PSR (EFPSR) models capture stochastic dynamical systems by representing state as the parameters of an exponential family distribution over a shortterm window of...

David Wingate, Satinder P. Singh

claim paper

Read More »

181

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

15 years 11 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

« Prev « First page 217 / 701 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers