Search Sciweavers | Sciweavers

2990 search results - page 376 / 598

» Hidden Markov processes

194

click to vote

IJCAI
2007

170views Artificial Intelligence» more IJCAI 2007»

First Order Decision Diagrams for Relational MDPs

15 years 8 months ago

Download www.cs.tufts.edu

Dynamic programming algorithms provide a basic tool identifying optimal solutions in Markov Decision Processes (MDP). The paper develops a representation for decision diagrams sui...

Chenggang Wang, Saket Joshi, Roni Khardon

claim paper

Read More »

171

click to vote

AAAI
2004

103views Intelligent Agents» more AAAI 2004»

Stochastic Local Search for POMDP Controllers

15 years 8 months ago

Download www.cs.utoronto.ca

The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their ...

Darius Braziunas, Craig Boutilier

claim paper

Read More »

159

click to vote

FLAIRS
2006

101views Artificial Intelligence» more FLAIRS 2006»

Stochastic Deliberation Scheduling using GSMDPs

15 years 8 months ago

Download www.aaai.org

We propose a new decision-theoretic approach for solving execution-time deliberation scheduling problems using recent advances in Generalized Semi-Markov Decision Processes (GSMDP...

Kurt D. Krebsbach

claim paper

Read More »

177

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 8 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

173

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 8 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

« Prev « First page 376 / 598 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers