Search Sciweavers | Sciweavers

3502 search results - page 218 / 701

» From Machine Learning to Machine Reasoning

180

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 10 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

160

click to vote

ML
1998
ACM

102views Machine Learning» more ML 1998»

Statistical Mechanics of Online Learning of Drifting Concepts: A Variational Approach

15 years 6 months ago

Download fig.if.usp.br

We review the application of statistical mechanics methods to the study of online learning of a drifting concept in the limit of large systems. The model where a feed-forward netwo...

Renato Vicente, Osame Kinouchi, Nestor Caticha

claim paper

Read More »

136

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 7 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

178

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 7 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

158

click to vote

ICML
2001
IEEE

126views Machine Learning» more ICML 2001»

Round Robin Rule Learning

16 years 7 months ago

Download www.eecs.wsu.edu

In this paper, we discuss a technique for handling multi-class problems with binary classifiers, namely to learn one classifier for each pair of classes. Although this idea is kno...

Johannes Fürnkranz

claim paper

Read More »

« Prev « First page 218 / 701 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers