Search Sciweavers | Sciweavers

2990 search results - page 477 / 598

» Hidden Markov processes

167

click to vote

NIPS
2004

224views Information Technology» more NIPS 2004»

Approximately Efficient Online Mechanism Design

15 years 7 months ago

Download www.cs.cmu.edu

Online mechanism design (OMD) addresses the problem of sequential decision making in a stochastic environment with multiple self-interested agents. The goal in OMD is to make valu...

David C. Parkes, Satinder P. Singh, Dimah Yanovsky

claim paper

Read More »

172

click to vote

NIPS
2004

115views Information Technology» more NIPS 2004»

Modelling Uncertainty in the Game of Go

15 years 7 months ago

Download books.nips.cc

Go is an ancient oriental game whose complexity has defeated attempts to automate it. We suggest using probability in a Bayesian sense to model the uncertainty arising from the va...

David H. Stern, Thore Graepel, David J. C. MacKay

claim paper

Read More »

144

click to vote

IJCAI
2003

123views Artificial Intelligence» more IJCAI 2003»

Automated Generation of Understandable Contingency Plans

15 years 7 months ago

Download anytime.cs.umass.edu

Markov decision processes (MDPs) and contingency planning (CP) are two widely used approaches to planning under uncertainty. MDPs are attractive because the model is extremely gen...

Max Horstmann, Shlomo Zilberstein

claim paper

Read More »

156

click to vote

IJCAI
2003

89views Artificial Intelligence» more IJCAI 2003»

Modular self-organization for a long-living autonomous agent

15 years 7 months ago

Download dli.iiit.ac.in

The aim of this paper is to provide a sound framework for addressing a difficult problem: the automatic construction of an autonomous agent's modular architecture. We briefly...

Bruno Scherrer

claim paper

Read More »

160

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

15 years 7 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

« Prev « First page 477 / 598 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers