Search Sciweavers | Sciweavers

3643 search results - page 405 / 729

» Learning Submodular Functions

162

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

15 years 8 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

192

click to vote

ICONIP
2007

141views Information Technology» more ICONIP 2007»

Natural Conjugate Gradient in Variational Inference

15 years 8 months ago

Download eprints.pascal-network.org

Variational methods for approximate inference in machine learning often adapt a parametric probability distribution to optimize a given objective function. This view is especially ...

Antti Honkela, Matti Tornio, Tapani Raiko, Juha Ka...

claim paper

Read More »

117

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Internal Rewards Mitigate Agent Boundedness

15 years 7 months ago

Download www-personal.umich.edu

Abstract--Reinforcement learning (RL) research typically develops algorithms for helping an RL agent best achieve its goals-however they came to be defined--while ignoring the rela...

Jonathan Sorg, Satinder P. Singh, Richard Lewis

claim paper

Read More »

180

click to vote

ICML
2010
IEEE

217views Machine Learning» more ICML 2010»

The Elastic Embedding Algorithm for Dimensionality Reduction

15 years 7 months ago

Download faculty.ucmerced.edu

We propose a new dimensionality reduction method, the elastic embedding (EE), that optimises an intuitive, nonlinear objective function of the low-dimensional coordinates of the d...

Miguel Á. Carreira-Perpiñán

claim paper

Read More »

182

click to vote

ICML
2010
IEEE

227views Machine Learning» more ICML 2010»

Label Ranking Methods based on the Plackett-Luce Model

15 years 7 months ago

Download www.uni-marburg.de

This paper introduces two new methods for label ranking based on a probabilistic model of ranking data, called the Plackett-Luce model. The idea of the first method is to use the ...

Weiwei Cheng, Krzysztof Dembczynski, Eyke Hül...

claim paper

Read More »

« Prev « First page 405 / 729 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers