Search Sciweavers | Sciweavers

515 search results - page 61 / 103

» Approximating Markov Processes by Averaging

181

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Transfer via soft homomorphisms

16 years 21 days ago

Download www.eecs.umich.edu

The ﬁeld of transfer learning aims to speed up learning across multiple related tasks by transferring knowledge between source and target tasks. Past work has shown that when th...

Jonathan Sorg, Satinder Singh

claim paper

Read More »

177

click to vote

IAT
2005
IEEE

132views Intelligent Agents» more IAT 2005»

Decomposing Large-Scale POMDP Via Belief State Analysis

15 years 11 months ago

Download www.comp.hkbu.edu.hk

Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing ...

Xin Li, William K. Cheung, Jiming Liu

claim paper

Read More »

154

click to vote

AIPS
2008

111views Artificial Intelligence» more AIPS 2008»

Multiagent Planning Under Uncertainty with Stochastic Communication Delays

15 years 8 months ago

Download www.aaai.org

We consider the problem of cooperative multiagent planning under uncertainty, formalized as a decentralized partially observable Markov decision process (Dec-POMDP). Unfortunately...

Matthijs T. J. Spaan, Frans A. Oliehoek, Nikos A. ...

claim paper

Read More »

183

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

15 years 8 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

149

click to vote

ATAL
2010
Springer

141views Intelligent Agents» more ATAL 2010»

Risk-sensitive planning in partially observable environments

15 years 7 months ago

Download www.aamas-conference.org

Partially Observable Markov Decision Process (POMDP) is a popular framework for planning under uncertainty in partially observable domains. Yet, the POMDP model is riskneutral in ...

Janusz Marecki, Pradeep Varakantham

claim paper

Read More »

« Prev « First page 61 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers