Sciweavers

1630 search results - page 143 / 326
» Coordinated Reinforcement Learning
Sort
View
AAAI
1996
15 years 7 months ago
Evolution-Based Discovery of Hierarchical Behaviors
Procedural representations of control policies have two advantages when facing the scale-up problem in learning tasks. First they are implicit, with potential for inductive genera...
Justinian P. Rosca, Dana H. Ballard
SOCROB
2010
126views Robotics» more  SOCROB 2010»
15 years 4 months ago
Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief
Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...
Antoine Hiolle, Lola Cañamero, Pierre Andry...
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
16 years 1 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
COLING
2000
15 years 7 months ago
Automatic Optimization of Dialogue Management
Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...
Diane J. Litman, Michael S. Kearns, Satinder P. Si...
AAMAS
1999
Springer
15 years 6 months ago
Learning Situation-Specific Coordination in Cooperative Multi-agent Systems
Achieving effective cooperation in a multi-agent system is a difficult problem for a number of reasons such as limited and possiblyout-datedviews of activitiesof other agents and ...
M. V. Nagendra Prasad, Victor R. Lesser