Search Sciweavers | Sciweavers

196

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

15 years 8 months ago

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

184

click to vote

AAAI
2010

201views Intelligent Agents» more AAAI 2010»

Compressing POMDPs Using Locality Preserving Non-Negative Matrix Factorization

15 years 8 months ago

Download www.cs.umass.edu

Partially Observable Markov Decision Processes (POMDPs) are a well-established and rigorous framework for sequential decision-making under uncertainty. POMDPs are well-known to be...

Georgios Theocharous, Sridhar Mahadevan

claim paper

Read More »

171

click to vote

UAI
2008

230views Artificial Intelligence» more UAI 2008»

Partitioned Linear Programming Approximations for MDPs

15 years 8 months ago

Download uai2008.cs.helsinki.fi

Approximate linear programming (ALP) is an efficient approach to solving large factored Markov decision processes (MDPs). The main idea of the method is to approximate the optimal...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

178

click to vote

AAAI
2006

86views Intelligent Agents» more AAAI 2006»

Targeting Specific Distributions of Trajectories in MDPs

15 years 8 months ago

Download www.cc.gatech.edu

We define TTD-MDPs, a novel class of Markov decision processes where the traditional goal of an agent is changed from finding an optimal trajectory through a state space to realiz...

David L. Roberts, Mark J. Nelson, Charles Lee Isbe...

claim paper

Read More »

143

click to vote

UAI
2004

101views Artificial Intelligence» more UAI 2004»

Region-Based Incremental Pruning for POMDPs

15 years 8 months ago

Download anytime.cs.umass.edu

We present a major improvement to the incremental pruning algorithm for solving partially observable Markov decision processes. Our technique targets the cross-sum step of the dyn...

Zhengzhu Feng, Shlomo Zilberstein

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers