Search Sciweavers | Sciweavers

1760 search results - page 73 / 352

» Learning from Partial Observations

158

click to vote

COLT
1998
Springer

105views Machine Learning» more COLT 1998»

Self Bounding Learning Algorithms

15 years 10 months ago

Download cseweb.ucsd.edu

Most of the work which attempts to give bounds on the generalization error of the hypothesis generated by a learning algorithm is based on methods from the theory of uniform conve...

Yoav Freund

claim paper

Read More »

157

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

15 years 7 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

142

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 7 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

161

click to vote

AIMSA
2004
Springer

104views Artificial Intelligence» more AIMSA 2004»

Towards Well-Defined Multi-agent Reinforcement Learning

15 years 10 months ago

Download userweb.port.ac.uk

Multi-agent reinforcement learning (MARL) is an emerging area of research. However, it lacks two important elements: a coherent view on MARL, and a well-defined problem objective. ...

Rinat Khoussainov

claim paper

Read More »

155

click to vote

CORR
2010
Springer

146views Education» more CORR 2010»

Adaptive Submodularity: A New Approach to Active Learning and Stochastic Optimization

15 years 6 months ago

Download www.cs.caltech.edu

Solving stochastic optimization problems under partial observability, where one needs to adaptively make decisions with uncertain outcomes, is a fundamental but notoriously diffic...

Daniel Golovin, Andreas Krause

claim paper

Read More »

« Prev « First page 73 / 352 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers