Search Sciweavers | Sciweavers

168 search results - page 18 / 34

» Optimism in Reinforcement Learning Based on Kullback-Leibler...

149

click to vote

ICML
2003
IEEE

105views Machine Learning» more ICML 2003»

Principled Methods for Advising Reinforcement Learning Agents

16 years 6 months ago

Download www.hpl.hp.com

An important issue in reinforcement learning is how to incorporate expert knowledge in a principled manner, especially as we scale up to real-world tasks. In this paper, we presen...

Eric Wiewiora, Garrison W. Cottrell, Charles Elkan

claim paper

Read More »

150

click to vote

CORR
2007
Springer

73views Education» more CORR 2007»

Universal Reinforcement Learning

15 years 6 months ago

Download www.stanford.edu

—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can inﬂuence futu...

Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...

claim paper

Read More »

163

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

208

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

16 years 6 months ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

147

click to vote

AAMAS
2005
Springer

126views Intelligent Agents» more AAMAS 2005»

Learning to Coordinate Using Commitment Sequences in Cooperative Multi-agent Systems

15 years 11 months ago

Download como.vub.ac.be

We report on an investigation of the learning of coordination in cooperative multi-agent systems. Speciﬁcally, we study solutions that are applicable to independent agents i.e. ...

Spiros Kapetanakis, Daniel Kudenko, Malcolm J. A. ...

claim paper

Read More »

« Prev « First page 18 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers