Search Sciweavers | Sciweavers

4255 search results - page 263 / 851

» On Learning Boolean Functions

158

click to vote

ICANN
2009
Springer

121views Neural Networks» more ICANN 2009»

Learning SVMs from Sloppily Labeled Data

15 years 11 months ago

Download www.lif.univ-mrs.fr

This paper proposes a modelling of Support Vector Machine (SVM) learning to address the problem of learning with sloppy labels. In binary classiﬁcation, learning with sloppy labe...

Guillaume Stempfel, Liva Ralaivola

claim paper

Read More »

183

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 10 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

205

click to vote

WSC
2007

166views Modeling And Simulation» more WSC 2007»

Optimizing time warp simulation with reinforcement learning techniques

15 years 9 months ago

Download www.informs-sim.org

Adaptive Time Warp protocols in the literature are usually based on a pre-deﬁned analytic model of the system, expressed as a closed form function that maps system state to cont...

Jun Wang, Carl Tropper

claim paper

Read More »

176

click to vote

HEURISTICS
2008

170views more HEURISTICS 2008»

Accelerating autonomous learning by using heuristic selection of actions

15 years 6 months ago

Download www.fei.edu.br

This paper investigates how to make improved action selection for online policy learning in robotic scenarios using reinforcement learning (RL) algorithms. Since finding control po...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

168

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

16 years 7 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

« Prev « First page 263 / 851 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers