Search Sciweavers | Sciweavers

82 search results - page 8 / 17

» Balancing Exploration and Exploitation in Learning to Rank O...

168

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 6 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

167

click to vote

ISPASS
2009
IEEE

160views Software Engineering» more ISPASS 2009»

Machine learning based online performance prediction for runtime parallelization and task scheduling

16 years 25 days ago

Download research.csc.ncsu.edu

—With the emerging many-core paradigm, parallel programming must extend beyond its traditional realm of scientiﬁc applications. Converting existing sequential applications as w...

Jiangtian Li, Xiaosong Ma, Karan Singh, Martin Sch...

claim paper

Read More »

171

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

15 years 7 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

169

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 7 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

152

click to vote

CVPR
2008
IEEE

158views Computer Vision» more CVPR 2008»

Information-theoretic active scene exploration

16 years 8 months ago

Download www.ece.wisc.edu

Studies support the need for high resolution imagery to identify persons in surveillance videos[13]. However, the use of telephoto lenses sacrifices a wider field of view and ther...

Eric Sommerlade, Ian Reid

claim paper

Read More »

« Prev « First page 8 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers