Sciweavers

2990 search results - page 367 / 598
» Hidden Markov processes
Sort
View
ATAL
2009
Springer
16 years 1 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
ATAL
2007
Springer
16 years 25 days ago
Autonomous nondeterministic tour guides: improving quality of experience with TTD-MDPs
In this paper, we address the problem of building a system of autonomous agents for a complex environment, in our case, a museum with many visitors. Visitors may have varying pref...
Andrew S. Cantino, David L. Roberts, Charles L. Is...
STACS
1997
Springer
15 years 10 months ago
Methods and Applications of (MAX, +) Linear Algebra
Exotic semirings such as the “(max, +) semiring” (R ∪ {−∞}, max, +), or the “tropical semiring” (N ∪ {+∞}, min, +), have been invented and reinvented many times s...
Stephane Gaubert, Max Plus
DEBS
2010
ACM
15 years 10 months ago
Predictive publish/subscribe matching
A new publish/subscribe capability is presented: the ability to predict the likelihood that a subscription will be matched at some point in the future. Composite subscriptions con...
Vinod Muthusamy, Haifeng Liu, Hans-Arno Jacobsen
AI
2006
Springer
15 years 10 months ago
An Efficient Resource Allocation Approach in Real-Time Stochastic Environment
We are interested in contributing to solving effectively a particular type of real-time stochastic resource allocation problem. Firstly, one distinction is that certain tasks may c...
Pierrick Plamondon, Brahim Chaib-draa, Abder Rezak...