Sciweavers

4446 search results - page 45 / 890
» Learning Observer Agents
Sort
View
ATAL
2010
Springer
15 years 1 months ago
Self-organisation in an agent network via learning
Dayong Ye, Minjie Zhang, Danny Sutanto
IJCAI
2001
15 years 7 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz