Search Sciweavers | Sciweavers

4255 search results - page 258 / 851

» On Learning Boolean Functions

161

click to vote

PKDD
2009
Springer

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

16 years 1 months ago

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, ...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

156

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 8 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

191

click to vote

JMLR
2006

108views more JMLR 2006»

Learning Spectral Clustering, With Application To Speech Separation

15 years 6 months ago

Download www.cs.berkeley.edu

Spectral clustering refers to a class of techniques which rely on the eigenstructure of a similarity matrix to partition points into disjoint clusters, with points in the same clu...

Francis R. Bach, Michael I. Jordan

claim paper

Read More »

147

click to vote

NECO
2008

60views more NECO 2008»

Sleeping Our Way to Weight Normalization and Stable Learning

15 years 6 months ago

Download www.cogsci.ucsd.edu

The functions of sleep have been an enduring mystery. Recently, Tononi and Cirelli hypothesized that one of the functions of slow-wave sleep is to scale down synapses in the corte...

Thomas J. Sullivan, Virginia R. de Sa

claim paper

Read More »

204

click to vote

ICASSP
2011
IEEE

165views Signal Processing» more ICASSP 2011»

A sliding-window online fast variational sparse Bayesian learning algorithm

14 years 10 months ago

Download mirlab.org

In this work a new online learning algorithm that uses automatic relevance determination (ARD) is proposed for fast adaptive nonlinear ﬁltering. A sequential decision rule for i...

Thomas Buchgraber, Dmitriy Shutin, H. Vincent Poor

claim paper

Read More »

« Prev « First page 258 / 851 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers