Sciweavers

683 search results - page 112 / 137
» Coarticulation in Markov Decision Processes
Sort
View
INFOCOM
2009
IEEE
16 years 22 days ago
Network Bandwidth Allocation via Distributed Auctions with Time Reservations
—This paper studies the problem of allocating network capacity through periodic auctions. Motivated primarily by a service overlay architecture, we impose the following condition...
Pablo Belzarena, Andrés Ferragut, Fernando ...
ATAL
2009
Springer
16 years 20 days ago
SarsaLandmark: an algorithm for learning in POMDPs with landmarks
Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...
Michael R. James, Satinder P. Singh
CDC
2008
IEEE
117views Control Systems» more  CDC 2008»
16 years 17 days ago
Event-based optimization for dispatching policies in material handling systems of general assembly lines
—A material handling (MH) system of a general assembly line dispatching parts from inventory to working buffers could be complicated and costly to operate. Generally it is extrem...
Yanjia Zhao, Qianchuan Zhao, Qing-Shan Jia, Xiaoho...
CDC
2008
IEEE
197views Control Systems» more  CDC 2008»
16 years 17 days ago
Dynamic spectrum access policies for cognitive radio
—We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooper...
Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli
CDC
2008
IEEE
204views Control Systems» more  CDC 2008»
16 years 17 days ago
Dynamic ping optimization for surveillance in multistatic sonar buoy networks with energy constraints
— In this paper we study the problem of dynamic optimization of ping schedule in an active sonar buoy network deployed to provide persistent surveillance of a littoral area throu...
Anshu Saksena, I-Jeng Wang