Sciweavers

3317 search results - page 440 / 664
» Strategies in Rigid-Variable Methods
Sort
View
CORR
2010
Springer
105views Education» more  CORR 2010»
15 years 5 months ago
Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence
We consider model-based reinforcement learning in finite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...
Sarah Filippi, Olivier Cappé, Aurelien Gari...
FOCM
2010
161views more  FOCM 2010»
15 years 5 months ago
The Asymptotics of Wilkinson's Shift: Loss of Cubic Convergence
One of the most widely used methods for eigenvalue computation is the QR iteration with Wilkinson’s shift: here the shift s is the eigenvalue of the bottom 2 × 2 principal mino...
Ricardo S. Leite, Nicolau C. Saldanha, Carlos Tome...
ICRA
2010
IEEE
143views Robotics» more  ICRA 2010»
15 years 5 months ago
Optimal Feedback Control for anthropomorphic manipulators
— We study target reaching tasks of redundant anthropomorphic manipulators under the premise of minimal energy consumption and compliance during motion. We formulate this motor c...
Djordje Mitrovic, Sho Nagashima, Stefan Klanke, Ta...
174
Voted
JNSM
2008
104views more  JNSM 2008»
15 years 5 months ago
Call Forwarding-Based Active Probing for POTS Fault Isolation
To ensure high availability of telephone services, there is considerable interest in network management activities to develop competent fault management mechanisms. In this paper w...
Chi-Shih Chao, Maitreya Natu, Adarshpal S. Sethi
INFOCOM
2010
IEEE
15 years 5 months ago
Throughput Analysis of Multiple Access Relay Channel under Collision Model
—Despite much research on the throughput of relaying networks under idealized interference models, many practical wireless networks rely on physical-layer protocols that preclude...
Seyed A. Hejazi, Ben Liang