Sciweavers

373 search results - page 44 / 75
» Covariant Policy Search
Sort
View
TSP
2010
15 years 23 days ago
Distributed learning in multi-armed bandit with multiple players
We formulate and study a decentralized multi-armed bandit (MAB) problem. There are distributed players competing for independent arms. Each arm, when played, offers i.i.d. reward a...
Keqin Liu, Qing Zhao
CDC
2010
IEEE
125views Control Systems» more  CDC 2010»
14 years 9 months ago
Persistent patrol with limited-range on-board sensors
— We propose and analyze the Persistent Patrol Problem (PPP). An unmanned aerial vehicle (UAV) moving with constant speed and unbounded acceleration patrols a bounded region of t...
Vu Anh Huynh, John Enright, Emilio Frazzoli
159
Voted
DSN
2008
IEEE
16 years 17 days ago
Scheduling algorithms for unpredictably heterogeneous CMP architectures
In future large-scale multi-core microprocessors, hard errors and process variations will create dynamic heterogeneity, causing performance and power characteristics to differ amo...
Jonathan A. Winter, David H. Albonesi
CSCLP
2006
Springer
15 years 9 months ago
Cost-Based Filtering for Stochastic Inventory Control
Abstract. An interesting class of production/inventory control problems considers a single product and a single stocking location, given a stochastic demand with a known non-statio...
Armagan Tarim, Brahim Hnich, Roberto Rossi, Steven...
PPL
2008
63views more  PPL 2008»
15 years 6 months ago
Using Hardware Multithreading to Overcome Broadcast/Reduction Latency in an Associative SIMD Processor
The latency of broadcast/reduction operations has a significant impact on the performance of SIMD processors. This is especially true for associative programs, which make extensiv...
Kevin Schaffer, Robert A. Walker