Sciweavers

3643 search results - page 266 / 729
» Learning Submodular Functions
Sort
View
ICML
2005
IEEE
16 years 7 months ago
Clustering through ranking on manifolds
Clustering aims to find useful hidden structures in data. In this paper we present a new clustering algorithm that builds upon the consistency method (Zhou, et.al., 2003), a semi-...
Markus Breitenbach, Gregory Z. Grudic
KCAP
2009
ACM
16 years 1 months ago
Interactively shaping agents via human reinforcement: the TAMER framework
As computational learning agents move into domains that incur real costs (e.g., autonomous driving or financial investment), it will be necessary to learn good policies without n...
W. Bradley Knox, Peter Stone
AIED
2007
Springer
16 years 24 days ago
MathGirls: Toward Developing Girls' Positive Attitude and Self-Efficacy through Pedagogical Agents
MathGirls is a pedagogical-agent-based environment designed for high-school girls learning introductory algebra. Since females are in general more interested in interactive computi...
Yanghee Kim, Quan Wei, Beijie Xu, Youngah Ko, Vess...
179
Voted
NIPS
2007
15 years 8 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...
IWLCS
2005
Springer
16 years 3 days ago
Counter Example for Q-Bucket-Brigade Under Prediction Problem
Aiming to clarify the convergence or divergence conditions for Learning Classifier System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...
Atsushi Wada, Keiki Takadama, Katsunori Shimohara