Sciweavers

7853 search results - page 341 / 1571
» Learning from Each Other
Sort
View
ICRA
2010
IEEE
149views Robotics» more  ICRA 2010»
15 years 5 months ago
A simple learning strategy for high-speed quadrocopter multi-flips
— We describe a simple and intuitive policy gradient method for improving parametrized quadrocopter multi-flips by combining iterative experiments with information from a first...
Sergei Lupashin, Angela Schöllig, Michael She...
SODA
2010
ACM
371views Algorithms» more  SODA 2010»
16 years 4 months ago
Online Learning with Queries
The online learning problem requires a player to iteratively choose an action in an unknown and changing environment. In the standard setting of this problem, the player has to ch...
Chao-Kai Chiang, Chi-Jen Lu
ICML
2004
IEEE
16 years 7 months ago
A multiplicative up-propagation algorithm
We present a generalization of the nonnegative matrix factorization (NMF), where a multilayer generative network with nonnegative weights is used to approximate the observed nonne...
Jong-Hoon Ahn, Seungjin Choi, Jong-Hoon Oh
IJCNN
2007
IEEE
16 years 1 months ago
Multi-Stage Optimal Component Analysis
— Optimal component analysis (OCA) uses a stochastic gradient optimization process to find optimal representations for general criteria and shows good performance in object reco...
Yiming Wu, Xiuwen Liu, Washington Mio
ALT
1994
Springer
15 years 11 months ago
Program Synthesis in the Presence of Infinite Number of Inaccuracies
Most studies modeling inaccurate data in Gold style learning consider cases in which the number of inaccuracies is finite. The present paper argues that this approach is not reaso...
Sanjay Jain