Search Sciweavers | Sciweavers

2750 search results - page 75 / 550

» The complexity of learning SUBSEQ(A)

166

click to vote

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

16 years 20 days ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

138

click to vote

ICDM
2005
IEEE

116views Data Mining» more ICDM 2005»

Learning Functional Dependency Networks Based on Genetic Programming

15 years 11 months ago

Download alumni.cuhk.edu.hk

Bayesian Network (BN) is a powerful network model, which represents a set of variables in the domain and provides the probabilistic relationships among them. But BN can handle dis...

Wing-Ho Shum, Kwong-Sak Leung, Man Leung Wong

claim paper

Read More »

155

click to vote

NN
2006
Springer

127views Neural Networks» more NN 2006»

The asymptotic equipartition property in reinforcement learning and its relation to return maximization

15 years 6 months ago

Download www.ece.uvic.ca

We discuss an important property called the asymptotic equipartition property on empirical sequences in reinforcement learning. This states that the typical set of empirical seque...

Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai

claim paper

Read More »

140

click to vote

ICML
2007
IEEE

145views Machine Learning» more ICML 2007»

Sample compression bounds for decision trees

16 years 7 months ago

Download www.machinelearning.org

We propose a formulation of the Decision Tree learning algorithm in the Compression settings and derive tight generalization error bounds. In particular, we propose Sample Compres...

Mohak Shah

claim paper

Read More »

126

click to vote

ALT
2010
Springer

122views Machine Learning» more ALT 2010»

Distribution-Dependent PAC-Bayes Priors

15 years 7 months ago

Download www.cs.ucl.ac.uk

We further develop the idea that the PAC-Bayes prior can be informed by the data-generating distribution. We prove sharp bounds for an existing framework of Gibbs algorithms, and ...

Guy Lever, François Laviolette, John Shawe-...

claim paper

Read More »

« Prev « First page 75 / 550 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers