Search Sciweavers | Sciweavers

7351 search results - page 1030 / 1471

» Machine Learning

196

click to vote

COLT
2010
Springer

149views Machine Learning» more COLT 2010»

Open Loop Optimistic Planning

15 years 4 months ago

Download www.colt2010.org

We consider the problem of planning in a stochastic and discounted environment with a limited numerical budget. More precisely, we investigate strategies exploring the set of poss...

Sébastien Bubeck, Rémi Munos

claim paper

Read More »

191

click to vote

COLT
2010
Springer

157views Machine Learning» more COLT 2010»

Efficient Classification for Metric Data

15 years 4 months ago

Download www.wisdom.weizmann.ac.il

Recent advances in large-margin classification of data residing in general metric spaces (rather than Hilbert spaces) enable classification under various natural metrics, such as ...

Lee-Ad Gottlieb, Leonid Kontorovich, Robert Krauth...

claim paper

Read More »

225

click to vote

COLT
2010
Springer

201views Machine Learning» more COLT 2010»

Forest Density Estimation

15 years 4 months ago

Download www.cs.cmu.edu

We study graph estimation and density estimation in high dimensions, using a family of density estimators based on forest structured undirected graphical models. For density estim...

Anupam Gupta, John D. Lafferty, Han Liu, Larry A. ...

claim paper

Read More »

231

click to vote

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

15 years 4 months ago

Download www.eecs.berkeley.edu

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

200

click to vote

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

15 years 4 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

« Prev « First page 1030 / 1471 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers