Sciweavers

2711 search results - page 72 / 543
» Convergence of the Wake-Sleep Algorithm
Sort
View
APPROX
2006
Springer
121views Algorithms» more  APPROX 2006»
15 years 9 months ago
A Randomized Solver for Linear Systems with Exponential Convergence
Abstract. The Kaczmarz method for solving linear systems of equations Ax = b is an iterative algorithm that has found many applications ranging from computer tomography to digital ...
Thomas Strohmer, Roman Vershynin
COLT
2004
Springer
15 years 11 months ago
Reinforcement Learning for Average Reward Zero-Sum Games
Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The first is based on relative Q-learning and the ...
Shie Mannor
COLT
2000
Springer
15 years 10 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
PIMRC
2008
IEEE
16 years 11 days ago
Iterative EM based channel estimation for KSP-OFDM
Abstract—This paper proposes a new iterative channel estimation algorithm for known symbol padding (KSP) Orthogonal Frequency Division Multiplexing (OFDM) based on the Expectatio...
Dieter Van Welden, Heidi Steendam
TSP
2010
15 years 21 days ago
Distributed consensus with quantized data via sequence averaging
The problem of distributed average consensus with quantized data is considered in this correspondence. Conventional consensus algorithms suffer from divergence when quantization er...
Jun Fang, Hongbin Li