Search Sciweavers | Sciweavers

180 search results - page 9 / 36

» On the Convergence Rate of Good-Turing Estimators

204

click to vote

CORR
2011
Springer

167views Education» more CORR 2011»

Fast global convergence of gradient methods for high-dimensional statistical recovery

15 years 1 months ago

Download www.cs.berkeley.edu

Many statistical M-estimators are based on convex optimization problems formed by the weighted sum of a loss function with a norm-based regularizer. We analyze the convergence rat...

Alekh Agarwal, Sahand Negahban, Martin J. Wainwrig...

claim paper

Read More »

173

click to vote

ISBI
2004
IEEE

150views Medical Imaging» more ISBI 2004»

A Fast Fully 4D Incremental Gradient Reconstruction Algorithm for List Mode PET Data

16 years 6 months ago

Download neuroimage.usc.edu

We present a fully four-dimensional, globally convergent, incremental gradient algorithm to estimate the continuous-time tracer density from list mode positron emission tomography...

Quanzheng Li, Evren Asma, Richard M. Leahy

claim paper

Read More »

185

click to vote

TNN
2010

176views Management» more TNN 2010»

On the weight convergence of Elman networks

15 years 21 days ago

Download www3.ntu.edu.sg

Abstract--An Elman network (EN) can be viewed as a feedforward (FF) neural network with an additional set of inputs from the context layer (feedback from the hidden layer). Therefo...

Qing Song

claim paper

Read More »

132

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 10 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

183

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 4 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

« Prev « First page 9 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers