Search Sciweavers | Sciweavers

3668 search results - page 454 / 734

» Margin Distribution and Learning

198

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 8 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

189

click to vote

JNCA
2006

114views more JNCA 2006»

An evolutionary approach to prototyping pedagogical agents: from simulation to integrated system

15 years 6 months ago

Download hal.archives-ouvertes.fr

We have developed and integrated software agents with two educational groupware systems (TeamWave Workplace and FLE), using evolutionary prototyping and empiricalbased design as d...

Anders I. Mørch, Jan A. Dolonen, Jan Eirik ...

claim paper

Read More »

162

click to vote

JDWM
2007

86views more JDWM 2007»

Predicting Future Customers via Ensembling Gradually Expanded Trees

15 years 6 months ago

Download cs.nju.edu.cn

Our LAMDAer team has won the PAKDD'06 Data Mining Competition (Open Category) Grand Champion. This report presents our solution to PAKDD'06 Data Mining Competition. Follo...

Yang Yu, De-Chuan Zhan, Xu-Ying Liu, Ming Li, Zhi-...

claim paper

Read More »

182

click to vote

ML
2000
ACM

150views Machine Learning» more ML 2000»

Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web

15 years 6 months ago

Download informatics.indiana.edu

This paper discusses a novel distributed adaptive algorithm and representation used to construct populations of adaptive Web agents. These InfoSpiders browse networked information ...

Filippo Menczer, Richard K. Belew

claim paper

Read More »

236

click to vote

ML
2002
ACM

178views Machine Learning» more ML 2002»

Metric-Based Methods for Adaptive Model Selection and Regularization

15 years 6 months ago

Download www.cs.cmu.edu

We present a general approach to model selection and regularization that exploits unlabeled data to adaptively control hypothesis complexity in supervised learning tasks. The idea ...

Dale Schuurmans, Finnegan Southey

claim paper

Read More »

« Prev « First page 454 / 734 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers