Sciweavers

2327 search results - page 123 / 466
» Consistency of functional learning methods based on derivati...
Sort
View
NN
1998
Springer
15 years 6 months ago
A tennis serve and upswing learning robot based on bi-directional theory
We experimented on task-level robot learning based on bi-directional theory. The via-point representation was used for ‘learning by watching’. In our previous work, we had a r...
Hiroyuki Miyamoto, Mitsuo Kawato
COMPIMAGE
2010
Springer
16 years 1 months ago
Curvature Estimation for Discrete Curves Based on Auto-adaptive Masks of Convolution
We propose a method that we call auto-adaptive convolution which extends the classical notion of convolution in pictures analysis to function analysis on a discrete set. We define...
Christophe Fiorio, Christian Mercat, Fréd&e...
ICML
2008
IEEE
16 years 7 months ago
Sample-based learning and search with permanent and transient memories
We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...
David Silver, Martin Müller 0003, Richard S. ...
SEAL
1998
Springer
15 years 10 months ago
Information Operator Scheduling by Genetic Algorithms
Abstract. In this paper, we discuss an approach to an operator scheduling problem in a large organization over time with the aim of maintaining service quality and reducing total l...
Takeshi Yamada, Kazuyuki Yoshimura, Ryohei Nakano
CORR
2010
Springer
105views Education» more  CORR 2010»
15 years 5 months ago
Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence
We consider model-based reinforcement learning in finite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...
Sarah Filippi, Olivier Cappé, Aurelien Gari...