Search Sciweavers | Sciweavers

1619 search results - page 177 / 324

» Structure in the Space of Value Functions

150

click to vote

WADS
2007
Springer

115views Algorithms» more WADS 2007»

Priority Queues Resilient to Memory Faults

16 years 17 days ago

Download www.cs.duke.edu

In the faulty-memory RAM model, the content of memory cells can get corrupted at any time during the execution of an algorithm, and a constant number of uncorruptible registers are...

Allan Grønlund Jørgensen, Gabriel Mo...

claim paper

Read More »

174

click to vote

NIPS
2007

166views Information Technology» more NIPS 2007»

Spatial Latent Dirichlet Allocation

15 years 8 months ago

Download books.nips.cc

In recent years, the language model Latent Dirichlet Allocation (LDA), which clusters co-occurring words into topics, has been widely applied in the computer vision ﬁeld. Howeve...

Xiaogang Wang, Eric Grimson

claim paper

Read More »

136

click to vote

ICPR
2008
IEEE

152views Computer Vision» more ICPR 2008»

Component-wise parameter smoothing for learning mixture models

16 years 7 months ago

Download www.cs.wayne.edu

In this paper, we propose a novel component-wise smoothing algorithm that constructs a hierarchy (or family) of smoothened log-likelihood surfaces. Our approach first smoothens th...

Bala Rajaratnam, Chandan K. Reddy

claim paper

Read More »

135

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 7 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

183

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 7 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 177 / 324 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers