Search Sciweavers | Sciweavers

683 search results - page 70 / 137

» Coarticulation in Markov Decision Processes

151

click to vote

AIPS
2007

80views Artificial Intelligence» more AIPS 2007»

Prioritizing Bellman Backups without a Priority Queue

15 years 8 months ago

Download www.cs.washington.edu

Several researchers have shown that the efﬁciency of value iteration, a dynamic programming algorithm for Markov decision processes, can be improved by prioritizing the order of...

Peng Dai, Eric A. Hansen

claim paper

Read More »

155

click to vote

AIPS
2008

111views Artificial Intelligence» more AIPS 2008»

Multiagent Planning Under Uncertainty with Stochastic Communication Delays

15 years 8 months ago

Download www.aaai.org

We consider the problem of cooperative multiagent planning under uncertainty, formalized as a decentralized partially observable Markov decision process (Dec-POMDP). Unfortunately...

Matthijs T. J. Spaan, Frans A. Oliehoek, Nikos A. ...

claim paper

Read More »

185

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

15 years 8 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

159

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 7 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

166

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 7 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

« Prev « First page 70 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers