Sciweavers

2327 search results - page 94 / 466
» Consistency of functional learning methods based on derivati...
Sort
View
ICML
1996
IEEE
15 years 10 months ago
A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning
This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...
Rémi Munos
ICML
2000
IEEE
16 years 7 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
ICIP
2000
IEEE
16 years 8 months ago
The Iterative Deconvolution of Linearly Blurred Images Using Non-Parametric Stabilizing Functions
An iterative solution to the problem of image deconvolution is presented. The previous image estimate is pre-filtered using a stabilizing function that is updated based on current...
James R. Hare, James P. Reilly
ESANN
2004
15 years 7 months ago
Neural methods for non-standard data
Standard pattern recognition provides effective and noise-tolerant tools for machine learning tasks; however, most approaches only deal with real vectors of a finite and fixed dime...
Barbara Hammer, Brijnesh J. Jain
HICSS
2002
IEEE
100views Biometrics» more  HICSS 2002»
15 years 11 months ago
Calculation of the Probability Density Function of Critical Clearing Time in Transient Stability Analysis
In this paper, the critical clearing time, tcc in power system transient stability analysis is modeled as a random variable due to the randomness nature of power system load. A lin...
Yiqiao Liang, Saffet Ayasun, Chika Nwankpa