Search Sciweavers | Sciweavers

1376 search results - page 176 / 276

» The Localization Hypothesis and Machines

159

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

186

click to vote

ICML
2006
IEEE

148views Machine Learning» more ICML 2006»

Bayesian pattern ranking for move prediction in the game of Go

16 years 7 months ago

Download research.microsoft.com

We investigate the problem of learning to predict moves in the board game of Go from game records of expert players. In particular, we obtain a probability distribution over legal...

David H. Stern, Ralf Herbrich, Thore Graepel

claim paper

Read More »

162

click to vote

ICML
2006
IEEE

161views Machine Learning» more ICML 2006»

Bayesian regression with input noise for high dimensional data

16 years 7 months ago

Download www-clmc.usc.edu

This paper examines high dimensional regression with noise-contaminated input and output data. Goals of such learning problems include optimal prediction with noiseless query poin...

Jo-Anne Ting, Aaron D'Souza, Stefan Schaal

claim paper

Read More »

178

click to vote

ICML
2006
IEEE

128views Machine Learning» more ICML 2006»

Discriminative cluster analysis

16 years 7 months ago

Download www.ri.cmu.edu

Clustering is one of the most widely used statistical tools for data analysis. Among all existing clustering techniques, k-means is a very popular method because of its ease of pr...

Fernando De la Torre, Takeo Kanade

claim paper

Read More »

137

click to vote

ICML
2005
IEEE

94views Machine Learning» more ICML 2005»

Multi-way distributional clustering via pairwise interactions

16 years 7 months ago

Download www.cs.umass.edu

We present a novel unsupervised learning scheme that simultaneously clusters variables of several types (e.g., documents, words and authors) based on pairwise interactions between...

Ron Bekkerman, Ran El-Yaniv, Andrew McCallum

claim paper

Read More »

« Prev « First page 176 / 276 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers