Sciweavers

2466 search results - page 380 / 494
» Algorithms for distributed functional monitoring
Sort
View
CDC
2009
IEEE
132views Control Systems» more  CDC 2009»
15 years 11 months ago
Q-learning and Pontryagin's Minimum Principle
Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...
Prashant G. Mehta, Sean P. Meyn
CVPR
1997
IEEE
15 years 10 months ago
Smoothness in Layers: Motion segmentation using nonparametric mixture estimation
Grouping based on common motion, or “common fate” provides a powerful cue for segmenting image sequences. Recently a number of algorithms have been developed that successfully...
Yair Weiss
ICCAD
1994
IEEE
61views Hardware» more  ICCAD 1994»
15 years 10 months ago
Simultaneous driver and wire sizing for performance and power optimization
In this paper, we study the simultaneousdriver and wire sizing (SDWS) problem under two objective functions: (i) delay minimization only, or (ii) combined delay and power dissipat...
Jason Cong, Cheng-Kok Koh
EDM
2010
160views Data Mining» more  EDM 2010»
15 years 7 months ago
Using Neural Imaging and Cognitive Modeling to Infer Mental States while Using an Intelligent Tutoring System
Functional magnetic resonance imaging (fMRI) data were collected while students worked with a tutoring system that taught an algebra isomorph. A cognitive model predicted the distr...
Jon M. Fincham, John R. Anderson, Shawn Betts, Jen...
NIPS
2001
15 years 7 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr