Sciweavers

4446 search results - page 435 / 890
» Learning Observer Agents
Sort
View
AAAI
1992
15 years 8 months ago
Automatic Programming of Robots Using Genetic Programming
The goal in automatic programming is to get a computer to perform a task by telling it what needs to be done, rather than by explicitly programming it. This paper considers the ta...
John R. Koza, James Rice
ECML
2007
Springer
16 years 1 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
149
Voted
ROBOCUP
2005
Springer
151views Robotics» more  ROBOCUP 2005»
16 years 10 days ago
Sequential Pattern Mining for Situation and Behavior Prediction in Simulated Robotic Soccer
Agents in dynamic environments have to deal with world rep- To appear in: RoboCup 2005: Robot Soccer World Cup IX, c Springer-Verlag, 2006 resentations that change over time. In or...
Andreas D. Lattner, Andrea Miene, Ubbo Visser, Ott...
173
Voted
DIS
2010
Springer
15 years 4 months ago
Concept Convergence in Empirical Domains
How to achieve shared meaning is a significant issue when more than one intelligent agent is involved in the same domain. We define the task of concept convergence, by which intell...
Santiago Ontañón, Enric Plaza
198
Voted
AAAI
2006
15 years 8 months ago
Action Selection in Bayesian Reinforcement Learning
My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...
Tao Wang