Search Sciweavers | Sciweavers

829 search results - page 71 / 166

» A time aggregation approach to Markov decision processes

171

click to vote

CORR
2010
Springer

174views Education» more CORR 2010»

Hybrid Numerical Solution of the Chemical Master Equation

15 years 6 months ago

Download alma.cs.uni-sb.de

We present a numerical approximation technique for the analysis of continuous-time Markov chains that describe networks of biochemical reactions and play an important role in the ...

Thomas A. Henzinger, Maria Mateescu, Linar Mikeev,...

claim paper

Read More »

187

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 6 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

156

click to vote

ICRA
2008
IEEE

128views Robotics» more ICRA 2008»

A point-based POMDP planner for target tracking

16 years 19 days ago

Download www.comp.nus.edu.sg

— Target tracking has two variants that are often studied independently with different approaches: target searching requires a robot to ﬁnd a target initially not visible, and ...

David Hsu, Wee Sun Lee, Nan Rong

claim paper

Read More »

155

click to vote

IJCAI
2003

137views Artificial Intelligence» more IJCAI 2003»

Approximating Optimal Policies for Agents with Limited Execution Resources

15 years 7 months ago

Download ai.stanford.edu

An agent with limited consumable execution resources needs policies that attempt to achieve good performance while respecting these limitations. Otherwise, an agent (such as a pla...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

177

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

15 years 4 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

« Prev « First page 71 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers