Search Sciweavers | Sciweavers

4446 search results - page 435 / 890

» Learning Observer Agents

212

click to vote

AAAI
1992

128views Intelligent Agents» more AAAI 1992»

Automatic Programming of Robots Using Genetic Programming

15 years 8 months ago

Download www.genetic-programming.com

The goal in automatic programming is to get a computer to perform a task by telling it what needs to be done, rather than by explicitly programming it. This paper considers the ta...

John R. Koza, James Rice

claim paper

Read More »

193

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

149

Voted

ROBOCUP
2005
Springer

151views Robotics» more ROBOCUP 2005»

Sequential Pattern Mining for Situation and Behavior Prediction in Simulated Robotic Soccer

16 years 10 days ago

Download www.informatik.uni-frankfurt.de

Agents in dynamic environments have to deal with world rep- To appear in: RoboCup 2005: Robot Soccer World Cup IX, c Springer-Verlag, 2006 resentations that change over time. In or...

Andreas D. Lattner, Andrea Miene, Ubbo Visser, Ott...

claim paper

Read More »

173

Voted

DIS
2010
Springer

158views Theoretical Computer Science» more DIS 2010»

Concept Convergence in Empirical Domains

15 years 4 months ago

Download www.iiia.csic.es

How to achieve shared meaning is a significant issue when more than one intelligent agent is involved in the same domain. We define the task of concept convergence, by which intell...

Santiago Ontañón, Enric Plaza

claim paper

Read More »

198

Voted

AAAI
2006

190views Intelligent Agents» more AAAI 2006»

Action Selection in Bayesian Reinforcement Learning

15 years 8 months ago

Download www.aaai.org

My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...

Tao Wang

claim paper

Read More »

« Prev « First page 435 / 890 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers