Search Sciweavers | Sciweavers

2467 search results - page 291 / 494

» Finite State Machines

177

click to vote

ICML
2005
IEEE

127views Machine Learning» more ICML 2005»

Exploration and apprenticeship learning in reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

181

click to vote

ICML
2005
IEEE

134views Machine Learning» more ICML 2005»

Computational aspects of Bayesian partition models

16 years 7 months ago

Download www.machinelearning.org

The conditional distribution of a discrete variable y, given another discrete variable x, is often specified by assigning one multinomial distribution to each state of x. The cost...

Mikko Koivisto, Kismat Sood

claim paper

Read More »

132

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Relational sequential inference with reliable observations

16 years 7 months ago

Download cobweb.ecn.purdue.edu

We present a trainable sequential-inference technique for processes with large state and observation spaces and relational structure. Our method assumes "reliable observation...

Alan Fern, Robert Givan

claim paper

Read More »

151

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 7 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

196

click to vote

ICML
2000
IEEE

155views Machine Learning» more ICML 2000»

Maximum Entropy Markov Models for Information Extraction and Segmentation

16 years 7 months ago

Download www.seas.upenn.edu

Hidden Markov models (HMMs) are a powerful probabilistic tool for modeling sequential data, and have been applied with success to many text-related tasks, such as part-of-speech t...

Andrew McCallum, Dayne Freitag, Fernando C. N. Per...

claim paper

Read More »

« Prev « First page 291 / 494 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers