Search Sciweavers | Sciweavers

2990 search results - page 432 / 598

» Hidden Markov processes

190

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 6 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

256

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

14 years 2 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

191

click to vote

CVPR
2008
IEEE

880views Computer Vision» more CVPR 2008»

Visibility in bad weather from a single image

16 years 8 months ago

Download people.cs.uu.nl

Bad weather, such as fog and haze, can significantly degrade the visibility of a scene. Optically, this is due to the substantial presence of particles in the atmosphere that abso...

Robby T. Tan

claim paper

Read More »

176

click to vote

ECCV
1998
Springer

148views Computer Vision» more ECCV 1998»

A Two-Stage Probabilistic Approach for Object Recognition

16 years 8 months ago

Download www5.informatik.uni-erlangen.de

Assume that some objects are present in an image but can be seen only partially and are overlapping each other. To recognize the objects, we have to rstly separate the objects from...

Stan Z. Li, Joachim Hornegger

claim paper

Read More »

194

click to vote

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

16 years 7 months ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

« Prev « First page 432 / 598 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers