Sciweavers

2990 search results - page 358 / 598
» Hidden Markov processes
Sort
View
ICIP
2002
IEEE
16 years 8 months ago
Unsupervised detection of contours using a statistical model
In this paper, we describe an unsupervised segmentation method for contours which proves quite adapted for the images obtained by electronic acquisition. We present two statistica...
François Destrempes, Max Mignotte
ICIP
2000
IEEE
16 years 8 months ago
Texture-Based Segmentation of Satellite Weather Imagery
Unsupervised segmentation of weather images into features that correspond to physical storms is a fundamental and difficult problem. Treating an infrared satellite image as a Mark...
V. Lakshmanan, Victor E. DeBrunner, R. Rabin
ICML
2006
IEEE
16 years 7 months ago
Qualitative reinforcement learning
When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...
Arkady Epshteyn, Gerald DeJong
ICML
2006
IEEE
16 years 7 months ago
PAC model-free reinforcement learning
For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...
Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...
ICML
2006
IEEE
16 years 7 months ago
An intrinsic reward mechanism for efficient exploration
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...
Özgür Simsek, Andrew G. Barto