Search Sciweavers | Sciweavers

7065 search results - page 1004 / 1413

» Iterative Scheduling Algorithms

173

click to vote

ICML
2007
IEEE

104views Machine Learning» more ICML 2007»

On one method of non-diagonal regularization in sparse Bayesian learning

16 years 7 months ago

Download www.machinelearning.org

In the paper we propose a new type of regularization procedure for training sparse Bayesian methods for classification. Transforming Hessian matrix of log-likelihood function to d...

Dmitry Kropotov, Dmitry Vetrov

claim paper

Read More »

204

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 7 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

195

click to vote

ICML
2001
IEEE

266views Machine Learning» more ICML 2001»

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

16 years 7 months ago

Download www.cis.upenn.edu

We present conditional random fields, a framework for building probabilistic models to segment and label sequence data. Conditional random fields offer several advantages over hid...

John D. Lafferty, Andrew McCallum, Fernando C. N. ...

claim paper

Read More »

179

click to vote

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

16 years 7 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

190

click to vote

ICML
2000
IEEE

192views Machine Learning» more ICML 2000»

Convergence Problems of General-Sum Multiagent Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Stochastic games are a generalization of MDPs to multiple agents, and can be used as a framework for investigating multiagent learning. Hu and Wellman (1998) recently proposed a m...

Michael H. Bowling

claim paper

Read More »

« Prev « First page 1004 / 1413 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers