Search Sciweavers | Sciweavers

1512 search results - page 221 / 303

» Qualitative reinforcement learning

200

click to vote

ACSE
2000
ACM

271views Theoretical Computer Science» more ACSE 2000»

The information environments program - a new design based IT degree

15 years 10 months ago

Download www.itee.uq.edu.au

The University of Queensland has recently established a new design-focused, studio-based IT degree at a new “flexible-learning” campus. The Bachelor of Information Environment...

Michael Docherty, Peter Sutton, Margot Brereton, S...

claim paper

Read More »

168

click to vote

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

15 years 10 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

183

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 7 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

167

click to vote

ICML
2005
IEEE

121views Machine Learning» more ICML 2005»

Combining model-based and instance-based learning for first order regression

16 years 7 months ago

Download www.cs.kuleuven.ac.be

T ORDER REGRESSION (EXTENDED ABSTRACT) Kurt Driessensa Saso Dzeroskib a Department of Computer Science, University of Waikato, Hamilton, New Zealand (kurtd@waikato.ac.nz) b Departm...

Kurt Driessens, Saso Dzeroski

claim paper

Read More »

176

click to vote

DSMML
2004
Springer

170views Machine Learning» more DSMML 2004»

Can Gaussian Process Regression Be Made Robust Against Model Mismatch?

15 years 11 months ago

Download eprints.pascal-network.org

Learning curves for Gaussian process (GP) regression can be strongly aﬀected by a mismatch between the ‘student’ model and the ‘teacher’ (true data generation process), e...

Peter Sollich

claim paper

Read More »

« Prev « First page 221 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers