Search Sciweavers | Sciweavers

829 search results - page 91 / 166

» A time aggregation approach to Markov decision processes

142

click to vote

IJCAI
2003

123views Artificial Intelligence» more IJCAI 2003»

Automated Generation of Understandable Contingency Plans

15 years 7 months ago

Download anytime.cs.umass.edu

Markov decision processes (MDPs) and contingency planning (CP) are two widely used approaches to planning under uncertainty. MDPs are attractive because the model is extremely gen...

Max Horstmann, Shlomo Zilberstein

claim paper

Read More »

185

click to vote

ICMLA
2009

171views Machine Learning» more ICMLA 2009»

Multiagent Transfer Learning via Assignment-Based Decomposition

15 years 4 months ago

Download web.engr.oregonstate.edu

We describe a system that successfully transfers value function knowledge across multiple subdomains of realtime strategy games in the context of multiagent reinforcement learning....

Scott Proper, Prasad Tadepalli

claim paper

Read More »

178

click to vote

ICCV
2009
IEEE

442views Computer Vision» more ICCV 2009»

Weakly supervised discriminative localization and classification: a joint learning process

16 years 11 months ago

Download www.ri.cmu.edu

Visual categorization problems, such as object classification or action recognition, are increasingly often approached using a detection strategy: a classifier function is first ...

Minh Hoai Nguyen, Lorenzo Torresani, Fernando de l...

claim paper

Read More »

165

click to vote

KES
2004
Springer

165views Information Technology» more KES 2004»

Decision Support System on the Grid

15 years 11 months ago

Download www.drts.co.uk

Aero engines are extremely reliable machines and operational failures are rare. However, currently great effort is being put into reducing the number of in-flight engine shutdowns,...

Max Ong, Xiaoxu Ren, J. Allan, Visakan Kadirkamana...

claim paper

Read More »

158

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 7 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

« Prev « First page 91 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers