Sciweavers

829 search results - page 91 / 166
» A time aggregation approach to Markov decision processes
Sort
View
IJCAI
2003
15 years 7 months ago
Automated Generation of Understandable Contingency Plans
Markov decision processes (MDPs) and contingency planning (CP) are two widely used approaches to planning under uncertainty. MDPs are attractive because the model is extremely gen...
Max Horstmann, Shlomo Zilberstein
ICMLA
2009
15 years 4 months ago
Multiagent Transfer Learning via Assignment-Based Decomposition
We describe a system that successfully transfers value function knowledge across multiple subdomains of realtime strategy games in the context of multiagent reinforcement learning....
Scott Proper, Prasad Tadepalli
ICCV
2009
IEEE
16 years 11 months ago
Weakly supervised discriminative localization and classification: a joint learning process
Visual categorization problems, such as object classification or action recognition, are increasingly often approached using a detection strategy: a classifier function is first ...
Minh Hoai Nguyen, Lorenzo Torresani, Fernando de l...
KES
2004
Springer
15 years 11 months ago
Decision Support System on the Grid
Aero engines are extremely reliable machines and operational failures are rare. However, currently great effort is being put into reducing the number of in-flight engine shutdowns,...
Max Ong, Xiaoxu Ren, J. Allan, Visakan Kadirkamana...
ICML
2003
IEEE
16 years 7 months ago
Exploration in Metric State Spaces
We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...
Sham Kakade, Michael J. Kearns, John Langford