Search Sciweavers | Sciweavers

2466 search results - page 380 / 494

» Algorithms for distributed functional monitoring

171

click to vote

CDC
2009
IEEE

132views Control Systems» more CDC 2009»

Q-learning and Pontryagin's Minimum Principle

15 years 11 months ago

Download www.stanford.edu

Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...

Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

167

click to vote

CVPR
1997
IEEE

163views Computer Vision» more CVPR 1997»

Smoothness in Layers: Motion segmentation using nonparametric mixture estimation

15 years 10 months ago

Download www.cs.huji.ac.il

Grouping based on common motion, or “common fate” provides a powerful cue for segmenting image sequences. Recently a number of algorithms have been developed that successfully...

Yair Weiss

claim paper

Read More »

147

click to vote

ICCAD
1994
IEEE

61views Hardware» more ICCAD 1994»

Simultaneous driver and wire sizing for performance and power optimization

15 years 10 months ago

Download cadlab.cs.ucla.edu

In this paper, we study the simultaneousdriver and wire sizing (SDWS) problem under two objective functions: (i) delay minimization only, or (ii) combined delay and power dissipat...

Jason Cong, Cheng-Kok Koh

claim paper

Read More »

163

click to vote

EDM
2010

160views Data Mining» more EDM 2010»

Using Neural Imaging and Cognitive Modeling to Infer Mental States while Using an Intelligent Tutoring System

15 years 7 months ago

Download educationaldatamining.org

Functional magnetic resonance imaging (fMRI) data were collected while students worked with a tutoring system that taught an algebra isomorph. A cognitive model predicted the distr...

Jon M. Fincham, John R. Anderson, Shawn Betts, Jen...

claim paper

Read More »

193

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 7 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

« Prev « First page 380 / 494 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers