Sciweavers

1566 search results - page 103 / 314
» Planning and learning together
Sort
View
ICML
2008
IEEE
16 years 7 months ago
Sample-based learning and search with permanent and transient memories
We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...
David Silver, Martin Müller 0003, Richard S. ...
CCE
2004
15 years 6 months ago
Pharmaceutical supply chains: key issues and strategies for optimisation
Supply chain optimisation is now a major research theme in process operations and management. A great deal of research has been undertaken on facility location and design, invento...
Nilay Shah
CHI
2006
ACM
16 years 6 months ago
The paradox of the assisted user: guidance can be counterproductive
This paper investigates the influence of interface styles on problem solving performance. It is often assumed that performance on problem solving tasks improves when users are ass...
Christof van Nimwegen, Daniel D. Burgos, Herre van...
ICRA
2002
IEEE
141views Robotics» more  ICRA 2002»
15 years 11 months ago
Movement Imitation with Nonlinear Dynamical Systems in Humanoid Robots
This article presents a new approach to movement planning, on-line trajectory modification, and imitation learning by representing movement plans based on a set of nonlinear diļ¬...
Auke Jan Ijspeert, Jun Nakanishi, Stefan Schaal
NIPS
1993
15 years 7 months ago
Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming
Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...
Christopher G. Atkeson