Search Sciweavers | Sciweavers

1974 search results - page 238 / 395

» Online learning in online auctions

176

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 11 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

202

click to vote

STOC
2005
ACM

129views Algorithms» more STOC 2005»

Learning with attribute costs

16 years 7 months ago

Download www.math.tau.ac.il

We study an extension of the "standard" learning models to settings where observing the value of an attribute has an associated cost (which might be different for differ...

Haim Kaplan, Eyal Kushilevitz, Yishay Mansour

claim paper

Read More »

160

click to vote

ISCA
2006
IEEE

138views Hardware» more ISCA 2006»

Learning-Based SMT Processor Resource Distribution via Hill-Climbing

16 years 19 days ago

Download maggini.eng.umd.edu

The key to high performance in Simultaneous Multithreaded (SMT) processors lies in optimizing the distribution of shared resources to active threads. Existing resource distributio...

Seungryul Choi, Donald Yeung

claim paper

Read More »

166

click to vote

SBIA
2004
Springer

113views Artificial Intelligence» more SBIA 2004»

Learning with Drift Detection

15 years 12 months ago

Download www2.mat.ua.pt

Abstract. Most of the work in machine learning assume that examples are generated at random according to some stationary probability distribution. In this work we study the problem...

João Gama, Pedro Medas, Gladys Castillo, Pe...

claim paper

Read More »

193

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 238 / 395 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers