Search Sciweavers | Sciweavers

186

ICML
2005
IEEE

127views Machine Learning» more ICML 2005»

Exploration and apprenticeship learning in reinforcement learning

16 years 7 months ago

We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

165

click to vote

ECML
1993
Springer

78views Machine Learning» more ECML 1993»

Integrated Learning Architectures

15 years 11 months ago

Download www.idi.ntnu.no

Research in systems where learning is integrated to other components like problem solving, vision, or natural language is becoming an important topic for Machine Learning. Situatio...

Enric Plaza, Agnar Aamodt, Ashwin Ram, Walter Van ...

claim paper

Read More »

179

click to vote

COLT
1994
Springer

96views Machine Learning» more COLT 1994»

Rigorous Learning Curve Bounds from Statistical Mechanics

15 years 11 months ago

Download www.cs.huji.ac.il

In this paper we introduce and investigate a mathematically rigorous theory of learning curves that is based on ideas from statistical mechanics. The advantage of our theory over ...

David Haussler, H. Sebastian Seung, Michael J. Kea...

claim paper

Read More »

188

Voted

COLT
2006
Springer

92views Machine Learning» more COLT 2006»

Stable Transductive Learning

15 years 10 months ago

Download eprints.pascal-network.org

Abstract. We develop a new error bound for transductive learning algorithms. The slack term in the new bound is a function of a relaxed notion of transductive stability, which meas...

Ran El-Yaniv, Dmitry Pechyony

claim paper

Read More »

208

click to vote

ECML
2006
Springer

137views Machine Learning» more ECML 2006»

Skill Acquisition Via Transfer Learning and Advice Taking

15 years 10 months ago

Download pages.cs.wisc.edu

We describe a reinforcement learning system that transfers skills from a previously learned source task to a related target task. The system uses inductive logic programming to ana...

Lisa Torrey, Jude W. Shavlik, Trevor Walker, Richa...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers