Search Sciweavers | Sciweavers

1912 search results - page 328 / 383

» Optimizing interconnection policies

181

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 7 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

174

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 7 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

151

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

16 years 7 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

149

click to vote

ICML
2003
IEEE

105views Machine Learning» more ICML 2003»

Principled Methods for Advising Reinforcement Learning Agents

16 years 7 months ago

Download www.hpl.hp.com

An important issue in reinforcement learning is how to incorporate expert knowledge in a principled manner, especially as we scale up to real-world tasks. In this paper, we presen...

Eric Wiewiora, Garrison W. Cottrell, Charles Elkan

claim paper

Read More »

175

click to vote

GLOBECOM
2009
IEEE

155views Communications» more GLOBECOM 2009»

Cognitive Radio Enhancements for Legacy Networks Using Cooperative Diversity

16 years 28 days ago

Download www.nd.edu

Abstract— Two driving goals for cognitive radio (CR) technique are maximizing spectrum utilization and avoiding interference to primary users. In this paper, we deal with the CR ...

Zhanwei Sun, Ioannis Krikidis, J. Nicholas Laneman...

claim paper

Read More »

« Prev « First page 328 / 383 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers