Sciweavers

4083 search results - page 643 / 817
» The EvA2 Optimization Framework
Sort
View
NIPS
2007
15 years 8 months ago
Bundle Methods for Machine Learning
We present a globally convergent method for regularized risk minimization problems. Our method applies to Support Vector estimation, regression, Gaussian Processes, and any other ...
Alex J. Smola, S. V. N. Vishwanathan, Quoc V. Le
UAI
2008
15 years 8 months ago
Model-Based Bayesian Reinforcement Learning in Large Structured Domains
Model-based Bayesian reinforcement learning has generated significant interest in the AI community as it provides an elegant solution to the optimal exploration-exploitation trade...
Stéphane Ross, Joelle Pineau
AIPS
2006
15 years 8 months ago
Solving Factored MDPs with Exponential-Family Transition Models
Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...
Branislav Kveton, Milos Hauskrecht
NIPS
2004
15 years 8 months ago
Responding to Modalities with Different Latencies
Motor control depends on sensory feedback in multiple modalities with different latencies. In this paper we consider within the framework of reinforcement learning how different s...
Fredrik Bissmarck, Hiroyuki Nakahara, Kenji Doya, ...
NIPS
2001
15 years 8 months ago
Boosting and Maximum Likelihood for Exponential Models
We derive an equivalence between AdaBoost and the dual of a convex optimization problem, showing that the only difference between minimizing the exponential loss used by AdaBoost ...
Guy Lebanon, John D. Lafferty