Search Sciweavers | Sciweavers

177

ICML
1997
IEEE

194views Machine Learning» more ICML 1997»

Hierarchical Explanation-Based Reinforcement Learning

16 years 7 months ago

Explanation-Based Reinforcement Learning (EBRL) was introduced by Dietterich and Flann as a way of combining the ability of Reinforcement Learning (RL) to learn optimal plans with...

Prasad Tadepalli, Thomas G. Dietterich

claim paper

Read More »

176

click to vote

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

16 years 7 months ago

Download www.ri.cmu.edu

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...

Geoffrey J. Gordon

claim paper

Read More »

202

click to vote

WWW
2008
ACM

150views Internet Technology» more WWW 2008»

Algorithm for stochastic multiple-choice knapsack problem and application to keywords bidding

16 years 7 months ago

Download www2008.org

We model budget-constrained keyword bidding in sponsored search auctions as a stochastic multiple-choice knapsack problem (S-MCKP) and design an algorithm to solve S-MCKP and the ...

Yunhong Zhou, Victor Naroditskiy

claim paper

Read More »

191

click to vote

WWW
2008
ACM

125views Internet Technology» more WWW 2008»

R-U-in?: doing what you like, with people whom you like

16 years 7 months ago

Download www2008.org

This paper presents R-U-In? ? a social networking application that leverages Web 2.0 and IMS-based Converged Networks technologies to create a rich next-generation service. R-U-In...

Nilanjan Banerjee, Dipanjan Chakraborty, Koustuv D...

claim paper

Read More »

177

click to vote

WWW
2005
ACM

177views Internet Technology» more WWW 2005»

Adaptive filtering of advertisements on web pages

16 years 7 months ago

Download www.www2005.org

We present a browser extension to dynamically learn to filter unwanted images (such as advertisements or flashy graphics) based on minimal user feedback. To do so, we apply the we...

Babak Esfandiari, Richard Nock

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers