Sciweavers

5009 search results - page 387 / 1002
» Value chain modelling using system dynamics
Sort
View
258
Voted
AI
1998
Springer
15 years 6 months ago
Model-Based Average Reward Reinforcement Learning
Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...
Prasad Tadepalli, DoKyeong Ok
171
Voted
WSC
2008
15 years 9 months ago
Applicability of hybrid simulation to different modes of governance in UK healthcare
Healthcare organizations exhibit both detailed and dynamic complexity. Effective and sustainable decisionmaking in healthcare requires tools that can comprehend this complexity. D...
Kirandeep Chahal, Tillal Eldabi
174
Voted
ICASSP
2011
IEEE
14 years 10 months ago
Dynamics of tongue gestures extracted automatically from ultrasound
We describe a system for automatically extracting dynamics of tongue gestures from ultrasound images of the tongue using translational deep belief networks (tDBNs). In tDBNs, a jo...
Jeff Berry, Ian Fasel
200
Voted
CDC
2010
IEEE
160views Control Systems» more  CDC 2010»
15 years 1 months ago
Adaptive bases for Q-learning
Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...
Dotan Di Castro, Shie Mannor
192
Voted
FCCM
2004
IEEE
144views VLSI» more  FCCM 2004»
15 years 10 months ago
Efficient Execution of Process Networks on a Reconfigurable Hardware Virtual Machine
In this paper we present a novel use of an FPGA as a computing element for streaming based application. We investigate the virtualized execution of dynamic reconfigurable tasks. We...
Matthias Dyer, Marco Platzner, Lothar Thiele