Sciweavers

2990 search results - page 432 / 598
» Hidden Markov processes
Sort
View
CORR
2006
Springer
113views Education» more  CORR 2006»
15 years 6 months ago
A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD
This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...
Manuel Loth, Philippe Preux
CSL
2012
Springer
14 years 2 months ago
Reinforcement learning for parameter estimation in statistical spoken dialogue systems
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...
Filip Jurcícek, Blaise Thomson, Steve Young
CVPR
2008
IEEE
16 years 8 months ago
Visibility in bad weather from a single image
Bad weather, such as fog and haze, can significantly degrade the visibility of a scene. Optically, this is due to the substantial presence of particles in the atmosphere that abso...
Robby T. Tan
ECCV
1998
Springer
16 years 8 months ago
A Two-Stage Probabilistic Approach for Object Recognition
Assume that some objects are present in an image but can be seen only partially and are overlapping each other. To recognize the objects, we have to rstly separate the objects from...
Stan Z. Li, Joachim Hornegger
ICML
1995
IEEE
16 years 7 months ago
Learning Policies for Partially Observable Environments: Scaling Up
Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...
Michael L. Littman, Anthony R. Cassandra, Leslie P...