Sciweavers

7064 search results - page 1119 / 1413
» From States to Histories
Sort
View
NIPS
2001
15 years 8 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
NIPS
2001
15 years 8 months ago
Sequential Noise Compensation by Sequential Monte Carlo Method
We present a sequential Monte Carlo method applied to additive noise compensation for robust speech recognition in time-varying noise. The method generates a set of samples accord...
K. Yao, S. Nakamura
IADIS
2003
15 years 8 months ago
A Mobile Agent Based Registration System
A mobile agent is a software agent that has the ability to transfer its program code, data and execution state across a network to a remote computer for execution. In this paper, ...
K. K. Wong, C. K. Heng, P. C. Leong, Ma-Tit Yap
ICMLA
2003
15 years 8 months ago
A Distributed Reinforcement Learning Approach to Pattern Inference in Go
— This paper shows that the distributed representation found in Learning Vector Quantization (LVQ) enables reinforcement learning methods to cope with a large decision search spa...
Myriam Abramson, Harry Wechsler
PSB
2004
15 years 8 months ago
Modeling Cellular Processes with Variational Bayesian Cooperative Vector Quantizer
Gene expression of a cell is controlled by sophisticated cellular processes. The capability of inferring the states of these cellular processes would provide insight into the mech...
Xinghua Lu, Milos Hauskrecht, Roger S. Day
« Prev « First page 1119 / 1413 Last » Next »