Sciweavers

3643 search results - page 218 / 729
» Learning Submodular Functions
Sort
View
ICML
2007
IEEE
16 years 7 months ago
A transductive framework of distance metric learning by spectral dimensionality reduction
Distance metric learning and nonlinear dimensionality reduction are two interesting and active topics in recent years. However, the connection between them is not thoroughly studi...
Fuxin Li, Jian Yang, Jue Wang
ICML
2008
IEEE
16 years 7 months ago
Learning all optimal policies with multiple criteria
We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear pref...
Leon Barrett, Srini Narayanan
ICML
2006
IEEE
16 years 7 months ago
Relational temporal difference learning
We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...
Nima Asgharbeygi, David J. Stracuzzi, Pat Langley
ICML
2005
IEEE
16 years 7 months ago
Reinforcement learning with Gaussian processes
Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...
Yaakov Engel, Shie Mannor, Ron Meir
CEC
2009
IEEE
16 years 1 months ago
Structure learning and optimisation in a Markov-network based estimation of distribution algorithm
—Structure learning is a crucial component of a multivariate Estimation of Distribution algorithm. It is the part which determines the interactions between variables in the proba...
Alexander E. I. Brownlee, John A. W. McCall, Siddh...