Search Sciweavers | Sciweavers

2566 search results - page 269 / 514

» The Online Median Problem

173

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 8 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

191

click to vote

ESA
2010
Springer

140views Algorithms» more ESA 2010»

A Robust PTAS for Machine Covering and Packing

15 years 6 months ago

Download www.math.tu-berlin.de

Abstract. Minimizing the makespan or maximizing the minimum machine load are two of the most important and fundamental parallel machine scheduling problems. In an online scenario, ...

Martin Skutella, José Verschae

claim paper

Read More »

170

click to vote

AUTOMATICA
2010

90views more AUTOMATICA 2010»

A line search improvement of efficient MPC

15 years 6 months ago

Download users.ox.ac.uk

A recent lifting technique led to a computationally efficient Model Predictive Control (MPC) strategy in which the online optimization is performed using a univariate Newton-Raphs...

Basil Kouvaritakis, Shuang Li, Mark Cannon

claim paper

Read More »

182

click to vote

AUTOMATICA
2008

74views more AUTOMATICA 2008»

Policy iteration based feedback control

15 years 6 months ago

Download www.cfins.au.tsinghua.edu.cn

It is well known that stochastic control systems can be viewed as Markov decision processes (MDPs) with continuous state spaces. In this paper, we propose to apply the policy iter...

Kan-Jian Zhang, Yan-Kai Xu, Xi Chen, Xi-Ren Cao

claim paper

Read More »

206

click to vote

DEDS
2010

88views more DEDS 2010»

Optimal Admission Control of Discrete Event Systems with Real-Time Constraints

15 years 6 months ago

Download vita.bu.edu

Abstract-- The problem of optimally controlling the processing rate of tasks in Discrete Event Systems (DES) with hard real-time constraints has been solved in [9] under the assump...

Jianfeng Mao, Christos G. Cassandras

claim paper

Read More »

« Prev « First page 269 / 514 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers