Sciweavers

187 search results - page 21 / 38
» Imitation and Reinforcement Learning in Agents with Heteroge...
Sort
View
ICANN
2001
Springer
15 years 10 months ago
Market-Based Reinforcement Learning in Partially Observable Worlds
Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...
Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber
ATAL
2009
Springer
16 years 20 days ago
Solving multiagent assignment Markov decision processes
We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...
Scott Proper, Prasad Tadepalli
GECCO
2009
Springer
200views Optimization» more  GECCO 2009»
16 years 19 days ago
Apply ant colony optimization to Tetris
Tetris is a falling block game where the player’s objective is to arrange a sequence of different shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...
Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...
PRIMA
2009
Springer
16 years 19 days ago
Recursive Adaptation of Stepsize Parameter for Non-stationary Environments
In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...
Itsuki Noda
ISADS
1999
IEEE
15 years 10 months ago
Emergence of Communication for Negotiation by a Recurrent Neural Network
We believe that communication in multi-agent system has two major meanings. One of them is to transmit one agent's observed information to the other. The other meaning is to ...
Katsunari Shibata, Koji Ito