Search Sciweavers | Sciweavers

2223 search results - page 249 / 445

» Implicit Online Learning

215

click to vote

CVPR
2010
IEEE

492views Computer Vision» more CVPR 2010»

P-N learning: Bootstrapping binary classifiers by structural constraints

15 years 4 months ago

Download www.ee.surrey.ac.uk

This paper shows that the performance of a binary classifier can be significantly improved by the processing of structured unlabeled data, i.e. data are structured if knowing the ...

Zdenek Kalal, Jiri Matas, Krystian Mikolajczyk

claim paper

Read More »

178

Voted

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

207

click to vote

COLT
2010
Springer

183views Machine Learning» more COLT 2010»

Regret Minimization With Concept Drift

15 years 4 months ago

Download www.seas.upenn.edu

In standard online learning, the goal of the learner is to maintain an average loss that is "not too big" compared to the loss of the best-performing function in a fixed...

Koby Crammer, Yishay Mansour, Eyal Even-Dar, Jenni...

claim paper

Read More »

146

click to vote

ICRA
2008
IEEE

119views Robotics» more ICRA 2008»

Towards schema-based, constructivist robot learning: Validating an evolutionary search algorithm for schema chunking

16 years 1 months ago

Download www.cs.utk.edu

— In this paper, we lay the groundwork for extending our previously developed ASyMTRe architecture to enable constructivist learning for multi-robot team tasks. The ASyMTRe archi...

Yifan Tang, Lynne E. Parker

claim paper

Read More »

176

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 11 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

« Prev « First page 249 / 445 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers