Search Sciweavers | Sciweavers

167

WSC
2008

149views Modeling And Simulation» more WSC 2008»

A Pi-calculus formalism for discrete event simulation

15 years 9 months ago

This paper presents PiDES, a formalism for discrete event simulation based on Pi-calculus. PiDES provides a rigorous semantics of behavior modeling and coordination for simulation...

Jianrui Wang, Richard A. Wysk

claim paper

Read More »

199

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

15 years 8 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

164

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 8 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

171

click to vote

CAINE
2003

121views Computer Science» more CAINE 2003»

POMDP Planning for High Level UAV Decisions: Search vs. Strike

15 years 8 months ago

Download www.cs.ndsu.nodak.edu

The Partially Observable Markov Decision Process (POMDP) model is explored for high level decision making for Unmanned Air Vehicles (UAVs). The type of UAV modeled is a flying mun...

Doug Schesvold, Jingpeng Tang, Benzir Md Ahmed, Ka...

claim paper

Read More »

182

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 8 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers