Search Sciweavers | Sciweavers

3643 search results - page 238 / 729

» Learning Submodular Functions

177

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

167

click to vote

COLT
2006
Springer

85views Machine Learning» more COLT 2006»

A Randomized Online Learning Algorithm for Better Variance Control

15 years 10 months ago

Download certis.enpc.fr

We propose a sequential randomized algorithm, which at each step concentrates on functions having both low risk and low variance with respect to the previous step prediction functi...

Jean-Yves Audibert

claim paper

Read More »

184

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 10 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

152

click to vote

ATAL
2008
Springer

145views Intelligent Agents» more ATAL 2008»

Artificial agents learning human fairness

15 years 8 months ago

Download www.sce.carleton.ca

Recent advances in technology allow multi-agent systems to be deployed in cooperation with or as a service for humans. Typically, those systems are designed assuming individually ...

Steven de Jong, Karl Tuyls, Katja Verbeeck

claim paper

Read More »

154

click to vote

CORR
2010
Springer

130views Education» more CORR 2010»

Approximated Structured Prediction for Learning Large Scale Graphical Models

15 years 6 months ago

Download ttic.uchicago.edu

In this paper we propose an approximated structured prediction framework for large scale graphical models and derive message-passing algorithms for learning their parameters effic...

Tamir Hazan, Raquel Urtasun

claim paper

Read More »

« Prev « First page 238 / 729 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers