This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
An iterative solution to the problem of image deconvolution is presented. The previous image estimate is pre-filtered using a stabilizing function that is updated based on current...
Standard pattern recognition provides effective and noise-tolerant tools for machine learning tasks; however, most approaches only deal with real vectors of a finite and fixed dime...
In this paper, the critical clearing time, tcc in power system transient stability analysis is modeled as a random variable due to the randomness nature of power system load. A lin...