Sciweavers

3381 search results - page 396 / 677
» LEO - DB2's LEarning Optimizer
Sort
View
ALT
2007
Springer
16 years 26 days ago
Online Regression Competitive with Changing Predictors
This paper deals with the problem of making predictions in the online mode of learning where the dependence of the outcome yt on the signal xt can change with time. The Aggregating...
Steven Busuttil, Yuri Kalnishkan
ATAL
2007
Springer
16 years 26 days ago
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective
This paper presents the dynamics of multiple reinforcement learning agents from an Evolutionary Game Theoretic (EGT) perspective. We provide a Replicator Dynamics model for tradit...
Liviu Panait, Karl Tuyls
DAGM
2007
Springer
16 years 25 days ago
The Minimum Volume Ellipsoid Metric
We propose an unsupervised “local learning” algorithm for learning a metric in the input space. Geometrically, for a given query point, the algorithm finds the minimum volume ...
Karim T. Abou-Moustafa, Frank P. Ferrie
GECCO
2007
Springer
192views Optimization» more  GECCO 2007»
16 years 25 days ago
Estimation of fitness landscape contours in EAs
Evolutionary algorithms applied in real domain should profit from information about the local fitness function curvature. This paper presents an initial study of an evolutionary...
Petr Posík, Vojtech Franc
ECML
2005
Springer
16 years 6 days ago
Multi-armed Bandit Algorithms and Empirical Evaluation
The multi-armed bandit problem for a gambler is to decide which arm of a K-slot machine to pull to maximize his total reward in a series of trials. Many real-world learning and opt...
Joannès Vermorel, Mehryar Mohri