Sciweavers

1227 search results - page 141 / 246
» Learning Rates for Q-Learning
Sort
View
ML
2007
ACM
127views Machine Learning» more  ML 2007»
15 years 5 months ago
Density estimation with stagewise optimization of the empirical risk
We consider multivariate density estimation with identically distributed observations. We study a density estimator which is a convex combination of functions in a dictionary and ...
Jussi Klemelä
ML
2007
ACM
192views Machine Learning» more  ML 2007»
15 years 5 months ago
Annealing stochastic approximation Monte Carlo algorithm for neural network training
We propose a general-purpose stochastic optimization algorithm, the so-called annealing stochastic approximation Monte Carlo (ASAMC) algorithm, for neural network training. ASAMC c...
Faming Liang
COLT
2010
Springer
15 years 4 months ago
Open Loop Optimistic Planning
We consider the problem of planning in a stochastic and discounted environment with a limited numerical budget. More precisely, we investigate strategies exploring the set of poss...
Sébastien Bubeck, Rémi Munos
ICDE
2008
IEEE
146views Database» more  ICDE 2008»
16 years 7 months ago
Explaining and Reformulating Authority Flow Queries
Authority flow is an effective ranking mechanism for answering queries on a broad class of data. Systems have been developed to apply this principle on the Web (PageRank and topic ...
Ramakrishna Varadarajan, Vagelis Hristidis, Louiqa...
CASON
2009
IEEE
16 years 1 months ago
Social Network - An Autonomous System Designed for Radio Recommendation
This paper describes the functions of a system proposed for the music tube recommendation from social network data base. Such a system enables the automatic collection, evaluation...
Grzegorz Dziczkowski, Lamine Bougueroua, Katarzyna...