Sciweavers

656 search results - page 67 / 132
» Complexity of finite-horizon Markov decision process problem...
Sort
View
CDC
2009
IEEE
134views Control Systems» more  CDC 2009»
15 years 11 months ago
Event-based control using quadratic approximate value functions
Abstract— In this paper we consider several problems involving control with limited actuation and sampling rates. Event-based control has emerged as an attractive approach for ad...
Randy Cogill
AAAI
2007
15 years 8 months ago
MasDISPO: A Multiagent Decision Support System for Steel Production and Control
In the majority of cases, steel production constitutes the inception of the Supply Chains they are involved just as in automotive clusters or aerospace. Steel manufacturing compan...
Sven Jacobi, Esteban León-Soto, Cristi&aacu...
NN
2010
Springer
125views Neural Networks» more  NN 2010»
15 years 4 months ago
Parameter-exploring policy gradients
We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...
Frank Sehnke, Christian Osendorfer, Thomas Rü...
AIPS
2008
15 years 8 months ago
HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot
Flexible general purpose robots need to tailor their visual processing to their task, on the fly. We propose a new approach to this within a planning framework, where the goal is ...
Mohan Sridharan, Jeremy L. Wyatt, Richard Dearden
ECAI
2008
Springer
15 years 8 months ago
A probabilistic analysis of diagnosability in discrete event systems
Abstract. This paper shows that we can take advantage of information about the probabilities of the occurrences of events, when this information is available, to refine the classic...
Farid Nouioua, Philippe Dague