Search Sciweavers | Sciweavers

172

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 1 months ago

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

172

click to vote

ATAL
2007
Springer

110views Intelligent Agents» more ATAL 2007»

Autonomous nondeterministic tour guides: improving quality of experience with TTD-MDPs

16 years 25 days ago

Download andrewcantino.com

In this paper, we address the problem of building a system of autonomous agents for a complex environment, in our case, a museum with many visitors. Visitors may have varying pref...

Andrew S. Cantino, David L. Roberts, Charles L. Is...

claim paper

Read More »

175

click to vote

STACS
1997
Springer

137views Theoretical Computer Science» more STACS 1997»

Methods and Applications of (MAX, +) Linear Algebra

15 years 10 months ago

Download www-rocq.inria.fr

Exotic semirings such as the “(max, +) semiring” (R ∪ {−∞}, max, +), or the “tropical semiring” (N ∪ {+∞}, min, +), have been invented and reinvented many times s...

Stephane Gaubert, Max Plus

claim paper

Read More »

165

click to vote

DEBS
2010
ACM

102views Distributed And Parallel Com...» more DEBS 2010»

Predictive publish/subscribe matching

15 years 10 months ago

Download www.eecg.toronto.edu

A new publish/subscribe capability is presented: the ability to predict the likelihood that a subscription will be matched at some point in the future. Composite subscriptions con...

Vinod Muthusamy, Haifeng Liu, Hans-Arno Jacobsen

claim paper

Read More »

164

click to vote

AI
2006
Springer

110views Artificial Intelligence» more AI 2006»

An Efficient Resource Allocation Approach in Real-Time Stochastic Environment

15 years 10 months ago

Download www.damas.ift.ulaval.ca

We are interested in contributing to solving effectively a particular type of real-time stochastic resource allocation problem. Firstly, one distinction is that certain tasks may c...

Pierrick Plamondon, Brahim Chaib-draa, Abder Rezak...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers