Search Sciweavers | Sciweavers

3502 search results - page 355 / 701

» From Machine Learning to Machine Reasoning

166

Voted

ICML
2004
IEEE

203views Machine Learning» more ICML 2004»

Training conditional random fields via gradient tree boosting

16 years 7 months ago

Download web.engr.oregonstate.edu

Conditional Random Fields (CRFs; Lafferty, McCallum, & Pereira, 2001) provide a flexible and powerful model for learning to assign labels to elements of sequences in such appl...

Thomas G. Dietterich, Adam Ashenfelter, Yaroslav B...

claim paper

Read More »

181

click to vote

ICML
2009
IEEE

150views Machine Learning» more ICML 2009»

Boosting with structural sparsity

16 years 7 months ago

Download www.cs.berkeley.edu

Despite popular belief, boosting algorithms and related coordinate descent methods are prone to overfitting. We derive modifications to AdaBoost and related gradient-based coordin...

John Duchi, Yoram Singer

claim paper

Read More »

191

click to vote

ICML
2009
IEEE

143views Machine Learning» more ICML 2009»

Uncertainty sampling and transductive experimental design for active dual supervision

16 years 7 months ago

Download people.cs.uchicago.edu

Dual supervision refers to the general setting of learning from both labeled examples as well as labeled features. Labeled features are naturally available in tasks such as text c...

Vikas Sindhwani, Prem Melville, Richard D. Lawrenc...

claim paper

Read More »

158

click to vote

ICML
2008
IEEE

144views Machine Learning» more ICML 2008»

An HDP-HMM for systems with state persistence

16 years 7 months ago

Download www.cs.berkeley.edu

The hierarchical Dirichlet process hidden Markov model (HDP-HMM) is a flexible, nonparametric model which allows state spaces of unknown size to be learned from data. We demonstra...

Emily B. Fox, Erik B. Sudderth, Michael I. Jordan,...

claim paper

Read More »

186

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 28 days ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 355 / 701 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers