Search Sciweavers | Sciweavers

2327 search results - page 123 / 466

» Consistency of functional learning methods based on derivati...

157

click to vote

NN
1998
Springer

67views Neural Networks» more NN 1998»

A tennis serve and upswing learning robot based on bi-directional theory

15 years 6 months ago

Download www.cns.atr.jp

We experimented on task-level robot learning based on bi-directional theory. The via-point representation was used for ‘learning by watching’. In our previous work, we had a r...

Hiroyuki Miyamoto, Mitsuo Kawato

claim paper

Read More »

187

click to vote

COMPIMAGE
2010
Springer

213views Solid Modeling» more COMPIMAGE 2010»

Curvature Estimation for Discrete Curves Based on Auto-adaptive Masks of Convolution

16 years 1 months ago

Download hal-lirmm.ccsd.cnrs.fr

We propose a method that we call auto-adaptive convolution which extends the classical notion of convolution in pictures analysis to function analysis on a discrete set. We deﬁne...

Christophe Fiorio, Christian Mercat, Fréd&e...

claim paper

Read More »

179

click to vote

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

16 years 7 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

128

click to vote

SEAL
1998
Springer

74views Machine Learning» more SEAL 1998»

Information Operator Scheduling by Genetic Algorithms

15 years 10 months ago

Download www.kecl.ntt.co.jp

Abstract. In this paper, we discuss an approach to an operator scheduling problem in a large organization over time with the aim of maintaining service quality and reducing total l...

Takeshi Yamada, Kazuyuki Yoshimura, Ryohei Nakano

claim paper

Read More »

171

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 5 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

« Prev « First page 123 / 466 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers