Sciweavers

1630 search results - page 204 / 326
» Coordinated Reinforcement Learning
Sort
View
INTERSPEECH
2010
15 years 1 months ago
Still talking to machines (cognitively speaking)
This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...
Steve Young
JMLR
2010
189views more  JMLR 2010»
15 years 1 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
ICML
1998
IEEE
16 years 7 months ago
Intra-Option Learning about Temporally Abstract Actions
tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...
Richard S. Sutton, Doina Precup, Satinder P. Singh
IJCAI
2001
15 years 7 months ago
Robot Weightlifting By Direct Policy Search
This paper describes a method for structuring a robot motor learning task. By designing a suitably parameterized policy, we show that a simple search algorithm, along with biologi...
Michael T. Rosenstein, Andrew G. Barto
GECCO
2005
Springer
139views Optimization» more  GECCO 2005»
15 years 12 months ago
Event-driven learning classifier systems for online soccer games
This paper reports on the application of classifier systems to the acquisition of decision-making algorithms for agents in online soccer games. The objective of this research is t...
Yuji Sato, Ryutaro Kanno