Search Sciweavers | Sciweavers

2990 search results - page 358 / 598

» Hidden Markov processes

170

click to vote

ICIP
2002
IEEE

87views Image Processing» more ICIP 2002»

Unsupervised detection of contours using a statistical model

16 years 8 months ago

Download www.iro.umontreal.ca

In this paper, we describe an unsupervised segmentation method for contours which proves quite adapted for the images obtained by electronic acquisition. We present two statistica...

François Destrempes, Max Mignotte

claim paper

Read More »

199

click to vote

ICIP
2000
IEEE

118views Image Processing» more ICIP 2000»

Texture-Based Segmentation of Satellite Weather Imagery

16 years 8 months ago

Download www.cimms.ou.edu

Unsupervised segmentation of weather images into features that correspond to physical storms is a fundamental and difficult problem. Treating an infrared satellite image as a Mark...

V. Lakshmanan, Victor E. DeBrunner, R. Rabin

claim paper

Read More »

159

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 7 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

169

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 7 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

162

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 7 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

« Prev « First page 358 / 598 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers