Search Sciweavers | Sciweavers

52 search results - page 8 / 11

» Approximate Convex Optimization by Online Game Playing

148

click to vote

AAMAS
2010
Springer

158views Intelligent Agents» more AAMAS 2010»

Coordinated learning in multiagent MDPs with infinite state-space

15 years 6 months ago

Download gaips.inesc-id.pt

Abstract In this paper we address the problem of simultaneous learning and coordination in multiagent Markov decision problems (MMDPs) with infinite state-spaces. We separate this ...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

140

click to vote

ATAL
2007
Springer

112views Intelligent Agents» more ATAL 2007»

A globally optimal algorithm for TTD-MDPs

16 years 5 days ago

Download www.cc.gatech.edu

In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)—a variant of MDPs in which the goal is to realize a speciﬁed distrib...

Sooraj Bhat, David L. Roberts, Mark J. Nelson, Cha...

claim paper

Read More »

149

click to vote

AAIM
2008
Springer

94views Algorithms» more AAIM 2008»

Speed Scaling with a Solar Cell

16 years 10 days ago

Download www.cs.pitt.edu

We consider the setting of a device that obtains it energy from a battery and some regenerative source such as a solar cell. We consider the speed scaling problem of scheduling a c...

Nikhil Bansal, Ho-Leung Chan, Kirk Pruhs

claim paper

Read More »

184

click to vote

SI3D
2005
ACM

146views Computer Graphics» more SI3D 2005»

User interfaces for interactive control of physics-based 3D characters

15 years 11 months ago

Download www.cs.ubc.ca

We present two user interfaces for the interactive control of dynamically-simulated characters. The ﬁrst interface uses an ‘action palette’ and targets sports prototyping ap...

Peng Zhao, Michiel van de Panne

claim paper

Read More »

198

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 11 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

« Prev « First page 8 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers