Sciweavers

515 search results - page 54 / 103
» Approximating Markov Processes by Averaging
Sort
View
CORR
2010
Springer
171views Education» more  CORR 2010»
15 years 1 months ago
Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach
We consider an opportunistic spectrum access (OSA) problem where the time-varying condition of each channel (e.g., as a result of random fading or certain primary users' activ...
Cem Tekin, Mingyan Liu
GLOBECOM
2006
IEEE
16 years 4 days ago
Feedback Capacity of Stationary Sources over Gaussian Intersymbol Interference Channels
Abstract— We consider discrete-time channels with finitelength intersymbol interference and additive Gaussian noise. The channel noise is considered to be a stationary ARMA (aut...
Shaohua Yang, Aleksandar Kavcic, Sekhar Tatikonda
CORR
2007
Springer
94views Education» more  CORR 2007»
15 years 6 months ago
Paging and Registration in Cellular Networks: Jointly Optimal Policies and an Iterative Algorithm
— This paper explores optimization of paging and registration policies in cellular networks. Motion is modeled as a discrete-time Markov process, and minimization of the discount...
Bruce Hajek, Kevin Mitzel, Sichao Yang
ICML
2007
IEEE
16 years 7 months ago
Automatic shaping and decomposition of reward functions
This paper investigates the problem of automatically learning how to restructure the reward function of a Markov decision process so as to speed up reinforcement learning. We begi...
Bhaskara Marthi
AAAI
2006
15 years 7 months ago
Point-based Dynamic Programming for DEC-POMDPs
We introduce point-based dynamic programming (DP) for decentralized partially observable Markov decision processes (DEC-POMDPs), a new discrete DP algorithm for planning strategie...
Daniel Szer, François Charpillet