Search Sciweavers | Sciweavers

1406 search results - page 72 / 282

» Learning Pseudo-independent Models: Analytical and Experimen...

147

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 7 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

155

click to vote

NN
2006
Springer

79views Neural Networks» more NN 2006»

The misbehavior of value and the discipline of the will

15 years 6 months ago

Download www.cns.nyu.edu

Most reinforcement learning models of animal conditioning operate under the convenient, though fictive, assumption that Pavlovian conditioning concerns prediction learning whereas...

Peter Dayan, Yael Niv, Ben Seymour, Nathaniel D. D...

claim paper

Read More »

click to vote

COLING
2010

110views Computational Linguistics» more COLING 2010»

Machine Transliteration: Leveraging on Third Languages

15 years 1 months ago

Download www.aclweb.org

This paper presents two pivot strategies for statistical machine transliteration, namely system-based pivot strategy and model-based pivot strategy. Given two independent source-p...

Min Zhang, Xiangyu Duan, Vladimir Pervouchine, Hai...

claim paper

Read More »

142

click to vote

AIRS
2010
Springer

225views Information Technology» more AIRS 2010»

Learning to Rank with Supplementary Data

15 years 4 months ago

Download research.microsoft.com

This paper is concerned with a new task of ranking, referred to as "supplementary data assisted ranking", or "supplementary ranking" for short. Different from c...

Wenkui Ding, Tao Qin, Xu-Dong Zhang

claim paper

Read More »

139

click to vote

ICML
2009
IEEE

141views Machine Learning» more ICML 2009»

A stochastic memoizer for sequence data

16 years 7 months ago

Download www.gatsby.ucl.ac.uk

We propose an unbounded-depth, hierarchical, Bayesian nonparametric model for discrete sequence data. This model can be estimated from a single training sequence, yet shares stati...

Frank Wood, Cédric Archambeau, Jan Gasthaus...

claim paper

Read More »

« Prev « First page 72 / 282 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers