Search Sciweavers | Sciweavers

158

ICML
2007
IEEE

119views Machine Learning» more ICML 2007»

Scalable training of L1-regularized log-linear models

16 years 7 months ago

The l-bfgs limited-memory quasi-Newton method is the algorithm of choice for optimizing the parameters of large-scale log-linear models with L2 regularization, but it cannot be us...

Galen Andrew, Jianfeng Gao

claim paper

Read More »

198

click to vote

ICML
2007
IEEE

149views Machine Learning» more ICML 2007»

Maximum margin clustering made practical

16 years 7 months ago

Download www.machinelearning.org

Maximum margin clustering (MMC) is a recent large margin unsupervised learning approach that has often outperformed conventional clustering methods. Computationally, it involves n...

Kai Zhang, Ivor W. Tsang, James T. Kwok

claim paper

Read More »

202

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 7 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

181

Voted

ICML
2007
IEEE

121views Machine Learning» more ICML 2007»

Manifold-adaptive dimension estimation

16 years 7 months ago

Download www.machinelearning.org

Intuitively, learning should be easier when the data points lie on a low-dimensional submanifold of the input space. Recently there has been a growing interest in algorithms that ...

Amir Massoud Farahmand, Csaba Szepesvári, J...

claim paper

Read More »

201

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 7 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers