Search Sciweavers | Sciweavers

829 search results - page 98 / 166

» A time aggregation approach to Markov decision processes

242

click to vote

Publication

273views

Monte Carlo Value Iteration for Continuous-State POMDPs

15 years 1 months ago

Download www.comp.nus.edu.sg

Partially observable Markov decision processes (POMDPs) have been successfully applied to various robot motion planning tasks under uncertainty. However, most existing POMDP algo...

Haoyu Bai, David Hsu, Wee Sun Lee, and Vien A. Ngo

posted by bhy

Read More »

206

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 1 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

191

click to vote

AAAI
2011

136views Intelligent Agents» more AAAI 2011»

Linear Dynamic Programs for Resource Management

14 years 6 months ago

Download www.cs.umass.edu

Sustainable resource management in many domains presents large continuous stochastic optimization problems, which can often be modeled as Markov decision processes (MDPs). To solv...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

178

click to vote

ICIP
2005
IEEE

230views Image Processing» more ICIP 2005»

Content-based video copy detection in large databases: a local fingerprints statistical similarity search approach

16 years 7 months ago

Download obuisson.free.fr

Recent methods based on interest points and local fingerprints have been proposed to perform robust CBCD (content-based copy detection) of images and video. They include two steps...

Alexis Joly, Carl Frélicot, Olivier Buisson

claim paper

Read More »

155

click to vote

ICRA
2010
IEEE

163views Robotics» more ICRA 2010»

Exploiting domain knowledge in planning for uncertain robot systems modeled as POMDPs

15 years 4 months ago

Download robotics.ai.uiuc.edu

Abstract— We propose a planning algorithm that allows usersupplied domain knowledge to be exploited in the synthesis of information feedback policies for systems modeled as parti...

Salvatore Candido, James C. Davidson, Seth Hutchin...

claim paper

Read More »

« Prev « First page 98 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers