Search Sciweavers | Sciweavers

1630 search results - page 204 / 326

» Coordinated Reinforcement Learning

182

click to vote

INTERSPEECH
2010

175views Signal Processing» more INTERSPEECH 2010»

Still talking to machines (cognitively speaking)

15 years 1 months ago

Download mi.eng.cam.ac.uk

This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...

Steve Young

claim paper

Read More »

208

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 1 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

161

click to vote

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

16 years 7 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

168

click to vote

IJCAI
2001

141views Artificial Intelligence» more IJCAI 2001»

Robot Weightlifting By Direct Policy Search

15 years 7 months ago

Download reference.kfupm.edu.sa

This paper describes a method for structuring a robot motor learning task. By designing a suitably parameterized policy, we show that a simple search algorithm, along with biologi...

Michael T. Rosenstein, Andrew G. Barto

claim paper

Read More »

168

click to vote

GECCO
2005
Springer

139views Optimization» more GECCO 2005»

Event-driven learning classifier systems for online soccer games

15 years 12 months ago

Download www.genetic-programming.org

This paper reports on the application of classifier systems to the acquisition of decision-making algorithms for agents in online soccer games. The objective of this research is t...

Yuji Sato, Ryutaro Kanno

claim paper

Read More »

« Prev « First page 204 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers