Search Sciweavers | Sciweavers

3643 search results - page 266 / 729

» Learning Submodular Functions

177

click to vote

ICML
2005
IEEE

162views Machine Learning» more ICML 2005»

Clustering through ranking on manifolds

16 years 7 months ago

Download cervisia.org

Clustering aims to find useful hidden structures in data. In this paper we present a new clustering algorithm that builds upon the consistency method (Zhou, et.al., 2003), a semi-...

Markus Breitenbach, Gregory Z. Grudic

claim paper

Read More »

174

click to vote

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

16 years 1 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

174

click to vote

AIED
2007
Springer

138views Artificial Intelligence» more AIED 2007»

MathGirls: Toward Developing Girls' Positive Attitude and Self-Efficacy through Pedagogical Agents

16 years 24 days ago

Download inst.usu.edu

MathGirls is a pedagogical-agent-based environment designed for high-school girls learning introductory algebra. Since females are in general more interested in interactive computi...

Yanghee Kim, Quan Wei, Beijie Xu, Youngah Ko, Vess...

claim paper

Read More »

179

Voted

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

215

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

16 years 3 days ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

« Prev « First page 266 / 729 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers