Search Sciweavers | Sciweavers

5771 search results - page 366 / 1155

» Learning similarity space

189

click to vote

IJCAI
2003

117views Artificial Intelligence» more IJCAI 2003»

A Learning Algorithm for Web Page Scoring Systems

15 years 8 months ago

Download dli.iiit.ac.in

Hyperlink analysis is a successful approach to define algorithms which compute the relevance of a document on the basis of the citation graph. In this paper we propose a technique...

Michelangelo Diligenti, Marco Gori, Marco Maggini

claim paper

Read More »

179

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

15 years 8 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

161

click to vote

ICASSP
2010
IEEE

155views Signal Processing» more ICASSP 2010»

Parametric dictionary learning using steepest descent

15 years 7 months ago

Download hal.archives-ouvertes.fr

In this paper, we suggest to use a steepest descent algorithm for learning a parametric dictionary in which the structure or atom functions are known in advance. The structure of ...

Mahdi Ataee, Hadi Zayyani, Massoud Babaie-Zadeh, C...

claim paper

Read More »

181

click to vote

JMLR
2006

112views more JMLR 2006»

Kernels on Prolog Proof Trees: Statistical Learning in the ILP Setting

15 years 6 months ago

Download jmlr.csail.mit.edu

We develop kernels for measuring the similarity between relational instances using background knowledge expressed in first-order logic. The method allows us to bridge the gap betw...

Andrea Passerini, Paolo Frasconi, Luc De Raedt

claim paper

Read More »

201

click to vote

ML
2002
ACM

168views Machine Learning» more ML 2002»

On Average Versus Discounted Reward Temporal-Difference Learning

15 years 6 months ago

Download web.mit.edu

We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...

John N. Tsitsiklis, Benjamin Van Roy

claim paper

Read More »

« Prev « First page 366 / 1155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers