Sciweavers

3668 search results - page 454 / 734
» Margin Distribution and Learning
Sort
View
NIPS
2001
15 years 8 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
JNCA
2006
114views more  JNCA 2006»
15 years 6 months ago
An evolutionary approach to prototyping pedagogical agents: from simulation to integrated system
We have developed and integrated software agents with two educational groupware systems (TeamWave Workplace and FLE), using evolutionary prototyping and empiricalbased design as d...
Anders I. Mørch, Jan A. Dolonen, Jan Eirik ...
JDWM
2007
86views more  JDWM 2007»
15 years 6 months ago
Predicting Future Customers via Ensembling Gradually Expanded Trees
Our LAMDAer team has won the PAKDD'06 Data Mining Competition (Open Category) Grand Champion. This report presents our solution to PAKDD'06 Data Mining Competition. Follo...
Yang Yu, De-Chuan Zhan, Xu-Ying Liu, Ming Li, Zhi-...
ML
2000
ACM
150views Machine Learning» more  ML 2000»
15 years 6 months ago
Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web
This paper discusses a novel distributed adaptive algorithm and representation used to construct populations of adaptive Web agents. These InfoSpiders browse networked information ...
Filippo Menczer, Richard K. Belew
ML
2002
ACM
178views Machine Learning» more  ML 2002»
15 years 6 months ago
Metric-Based Methods for Adaptive Model Selection and Regularization
We present a general approach to model selection and regularization that exploits unlabeled data to adaptively control hypothesis complexity in supervised learning tasks. The idea ...
Dale Schuurmans, Finnegan Southey