Search Sciweavers | Sciweavers

6366 search results - page 868 / 1274

» Statistical Performance Modeling and Optimization

186

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Combinatorial resource scheduling for multiagent MDPs

16 years 1 months ago

Download ai.stanford.edu

Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial ...

Dmitri A. Dolgov, Michael R. James, Michael E. Sam...

claim paper

Read More »

209

click to vote

ATAL
2007
Springer

142views Intelligent Agents» more ATAL 2007»

Q-value functions for decentralized POMDPs

16 years 1 months ago

Download www.science.uva.nl

Planning in single-agent models like MDPs and POMDPs can be carried out by resorting to Q-value functions: a (near-) optimal Q-value function is computed in a recursive manner by ...

Frans A. Oliehoek, Nikos A. Vlassis

claim paper

Read More »

184

click to vote

ATAL
2007
Springer

158views Intelligent Agents» more ATAL 2007»

An advanced bidding agent for advertisement selection on public displays

16 years 1 months ago

Download eprints.ecs.soton.ac.uk

In this paper we present an advanced bidding agent that participates in ﬁrst-price sealed bid auctions to allocate advertising space on BluScreen – an experimental public adve...

Alex Rogers, Esther David, Terry R. Payne, Nichola...

claim paper

Read More »

174

click to vote

IPSN
2007
Springer

124views Sensor Networks» more IPSN 2007»

Power scheduling for wireless sensor and actuator networks

16 years 1 months ago

Download www.ece.rice.edu

We previously presented a model for some wireless sensor and actuator network (WSAN) applications based on the vector space tools of frame theory. In this WSAN model there is a we...

Christopher J. Rozell, Don H. Johnson

claim paper

Read More »

181

click to vote

CDC
2009
IEEE

132views Control Systems» more CDC 2009»

Q-learning and Pontryagin's Minimum Principle

15 years 11 months ago

Download www.stanford.edu

Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...

Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

« Prev « First page 868 / 1274 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers