Search Sciweavers | Sciweavers

4255 search results - page 381 / 851

» On Learning Boolean Functions

197

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 8 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

140

click to vote

NIPS
2007

127views Information Technology» more NIPS 2007»

On higher-order perceptron algorithms

15 years 8 months ago

Download books.nips.cc

A new algorithm for on-line learning linear-threshold functions is proposed which efﬁciently combines second-order statistics about the data with the ”logarithmic behavior” ...

Claudio Gentile, Fabio Vitale, Cristian Brotto

claim paper

Read More »

186

click to vote

GECCO
2008
Springer

170views Optimization» more GECCO 2008»

Evolving prediction weights using evolution strategy

15 years 7 months ago

Download www.cs.bham.ac.uk

The evolution strategy is one of the strongest evolutionary algorithms for optimizing real-value vectors. In this paper, we study how to use it for the evolution of prediction wei...

Trung Hau Tran, Cédric Sanza, Yves Duthen

claim paper

Read More »

180

click to vote

ICML
2005
IEEE

162views Machine Learning» more ICML 2005»

Clustering through ranking on manifolds

16 years 7 months ago

Download cervisia.org

Clustering aims to find useful hidden structures in data. In this paper we present a new clustering algorithm that builds upon the consistency method (Zhou, et.al., 2003), a semi-...

Markus Breitenbach, Gregory Z. Grudic

claim paper

Read More »

177

click to vote

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

16 years 1 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

« Prev « First page 381 / 851 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers