Sciweavers

9841 search results - page 327 / 1969
» Distributed Value Functions
Sort
View
JASIS
2007
83views more  JASIS 2007»
15 years 6 months ago
Dynamic h-index: The Hirsch index in function of time
When we have a group of papers and when we fix the present time we can determine the unique number h being the number of papers that received h or more citations while the other p...
Leo Egghe
EDBT
2009
ACM
104views Database» more  EDBT 2009»
16 years 1 months ago
A query processor for prediction-based monitoring of data streams
Networks of sensors are used in many different fields, from industrial applications to surveillance applications. A common feature of these applications is the necessity of a mo...
Sergio Ilarri, Ouri Wolfson, Eduardo Mena, Arantza...
ICML
1996
IEEE
15 years 10 months ago
A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning
This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...
Rémi Munos
NIPS
2001
15 years 8 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
APVIS
2010
15 years 7 months ago
Volume visualization based on statistical transfer-function spaces
It is a difficult task to design transfer functions for noisy data. In traditional transfer-function spaces, data values of different materials overlap. In this paper we introduce...
Martin Haidacher, Daniel Patel, Stefan Bruckner, A...