Search Sciweavers | Sciweavers

2005 search results - page 159 / 401

» Decisive Markov Chains

177

click to vote

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

15 years 11 months ago

Download www.hutter1.net

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

140

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

174

click to vote

AAAI
2006

94views Intelligent Agents» more AAAI 2006»

Factored MDP Elicitation and Plan Display

15 years 8 months ago

Download www.aaai.org

The software suite we will demonstrate at AAAI '06 was designed around planning with factored Markov decision processes (MDPs). It is a user-friendly suite that facilitates d...

Krol Kevin Mathias, Casey Lengacher, Derek William...

claim paper

Read More »

142

click to vote

AIPS
2006

161views Artificial Intelligence» more AIPS 2006»

Automated Planning Using Quantum Computation

15 years 8 months ago

Download www.aaai.org

This paper presents an adaptation of the standard quantum search technique to enable application within Dynamic Programming, in order to optimise a Markov Decision Process. This i...

Sanjeev Naguleswaran, Langford B. White, I. Fuss

claim paper

Read More »

154

click to vote

AIPS
2003

149views Artificial Intelligence» more AIPS 2003»

Synthesis of Hierarchical Finite-State Controllers for POMDPs

15 years 8 months ago

Download www.aaai.org

We develop a hierarchical approach to planning for partially observable Markov decision processes (POMDPs) in which a policy is represented as a hierarchical ﬁnite-state control...

Eric A. Hansen, Rong Zhou

claim paper

Read More »

« Prev « First page 159 / 401 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers