Sciweavers

8499 search results - page 335 / 1700
» cans 2009
Sort
View
ICML
2009
IEEE
16 years 7 months ago
Proto-predictive representation of states with simple recurrent temporal-difference networks
We propose a new neural network architecture, called Simple Recurrent Temporal-Difference Networks (SR-TDNs), that learns to predict future observations in partially observable en...
Takaki Makino
ICML
2009
IEEE
16 years 7 months ago
Partial order embedding with multiple kernels
We consider the problem of embedding arbitrary objects (e.g., images, audio, documents) into Euclidean space subject to a partial order over pairwise distances. Partial order cons...
Brian McFee, Gert R. G. Lanckriet
ICML
2009
IEEE
16 years 7 months ago
Bayesian inference for Plackett-Luce ranking models
This paper gives an efficient Bayesian method for inferring the parameters of a PlackettLuce ranking model. Such models are parameterised distributions over rankings of a finite s...
John Guiver, Edward Snelson
ICML
2009
IEEE
16 years 7 months ago
Model-free reinforcement learning as mixture learning
We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...
Nikos Vlassis, Marc Toussaint
ICML
2009
IEEE
16 years 7 months ago
Hilbert space embeddings of conditional distributions with applications to dynamical systems
In this paper, we extend the Hilbert space embedding approach to handle conditional distributions. We derive a kernel estimate for the conditional embedding, and show its connecti...
Le Song, Jonathan Huang, Alexander J. Smola, Kenji...