Search Sciweavers | Sciweavers

3718 search results - page 284 / 744

» On learning with dissimilarity functions

174

click to vote

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

16 years 1 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

175

click to vote

AIED
2007
Springer

138views Artificial Intelligence» more AIED 2007»

MathGirls: Toward Developing Girls' Positive Attitude and Self-Efficacy through Pedagogical Agents

16 years 26 days ago

Download inst.usu.edu

MathGirls is a pedagogical-agent-based environment designed for high-school girls learning introductory algebra. Since females are in general more interested in interactive computi...

Yanghee Kim, Quan Wei, Beijie Xu, Youngah Ko, Vess...

claim paper

Read More »

180

Voted

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

218

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

16 years 5 days ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

201

click to vote

ICML
2008
IEEE

147views Machine Learning» more ICML 2008»

Graph transduction via alternating minimization

16 years 7 months ago

Download www1.cs.columbia.edu

Graph transduction methods label input data by learning a classification function that is regularized to exhibit smoothness along a graph over labeled and unlabeled samples. In pr...

Jun Wang, Tony Jebara, Shih-Fu Chang

claim paper

Read More »

« Prev « First page 284 / 744 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers