Sciweavers

1406 search results - page 72 / 282
» Learning Pseudo-independent Models: Analytical and Experimen...
Sort
View
ICML
2006
IEEE
16 years 7 months ago
An intrinsic reward mechanism for efficient exploration
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...
Özgür Simsek, Andrew G. Barto
NN
2006
Springer
15 years 6 months ago
The misbehavior of value and the discipline of the will
Most reinforcement learning models of animal conditioning operate under the convenient, though fictive, assumption that Pavlovian conditioning concerns prediction learning whereas...
Peter Dayan, Yael Niv, Ben Seymour, Nathaniel D. D...
COLING
2010
15 years 1 months ago
Machine Transliteration: Leveraging on Third Languages
This paper presents two pivot strategies for statistical machine transliteration, namely system-based pivot strategy and model-based pivot strategy. Given two independent source-p...
Min Zhang, Xiangyu Duan, Vladimir Pervouchine, Hai...
AIRS
2010
Springer
15 years 4 months ago
Learning to Rank with Supplementary Data
This paper is concerned with a new task of ranking, referred to as "supplementary data assisted ranking", or "supplementary ranking" for short. Different from c...
Wenkui Ding, Tao Qin, Xu-Dong Zhang
ICML
2009
IEEE
16 years 7 months ago
A stochastic memoizer for sequence data
We propose an unbounded-depth, hierarchical, Bayesian nonparametric model for discrete sequence data. This model can be estimated from a single training sequence, yet shares stati...
Frank Wood, Cédric Archambeau, Jan Gasthaus...