Search Sciweavers | Sciweavers

829 search results - page 53 / 166

» A time aggregation approach to Markov decision processes

134

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

16 years 9 days ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

182

click to vote

ICIP
2002
IEEE

153views Image Processing» more ICIP 2002»

Extract highlights from baseball game video with hidden Markov models

16 years 7 months ago

Download www.ri.cmu.edu

In this paper, we describe a statistical method to detect highlights in a baseball game video. The input video is first segmented into scene shots, within which the camera motion ...

Peng Chang, Mei Han, Yihong Gong

claim paper

Read More »

189

click to vote

AAAI
2012

191views Intelligent Agents» more AAAI 2012»

Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication

13 years 8 months ago

Download people.csail.mit.edu

Planning under uncertainty is an important and challenging problem in multiagent systems. Multiagent Partially Observable Markov Decision Processes (MPOMDPs) provide a powerful fr...

Frans Adriaan Oliehoek, Matthijs T. J. Spaan

claim paper

Read More »

145

click to vote

ECML
2006
Springer

88views Machine Learning» more ECML 2006»

Reinforcement Learning for MDPs with Constraints

15 years 8 months ago

Download www.peter-geibel.de

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...

Peter Geibel

claim paper

Read More »

153

click to vote

NETWORKING
2000

88views Computer Networks» more NETWORKING 2000»

Fairness and Aggregation: A Primal Decomposition Study

15 years 7 months ago

Download www.ece.uwaterloo.ca

Abstract. We examine the fair allocation of capacity to a large population of best-effort connections in a typical multiple access communication system supporting some bandwidth on...

André Girard, Catherine Rosenberg, Mohammed...

claim paper

Read More »

« Prev « First page 53 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers