Search Sciweavers | Sciweavers

176

NIPS
2003

119views Information Technology» more NIPS 2003»

All learning is Local: Multi-agent Learning in Global Reward Games

15 years 8 months ago

In large multiagent games, partial observability, coordination, and credit assignment persistently plague attempts to design good learning algorithms. We provide a simple and ef�...

Yu-Han Chang, Tracey Ho, Leslie Pack Kaelbling

claim paper

Read More »

164

click to vote

NIPS
2003

152views Information Technology» more NIPS 2003»

Learning Near-Pareto-Optimal Conventions in Polynomial Time

15 years 8 months ago

Download www-2.cs.cmu.edu

We study how to learn to play a Pareto-optimal strict Nash equilibrium when there exist multiple equilibria and agents may have different preferences among the equilibria. We focu...

Xiao Feng Wang, Tuomas Sandholm

claim paper

Read More »

171

click to vote

ACSC
2007
IEEE

111views Theoretical Computer Science» more ACSC 2007»

Mutually Visible Agents in a Discrete Environment

16 years 1 months ago

Download crpit.com

As computer controlled entities are set to move and explore more complex environments they need to be able to perform navigation tasks, like ﬁnding minimal cost routes. Much wor...

Joel Fenwick, Vladimir Estivill-Castro

claim paper

Read More »

171

click to vote

AUSAI
2003
Springer

177views Artificial Intelligence» more AUSAI 2003»

BN+BN: Behavior Network with Bayesian Network for Intelligent Agent

16 years 3 hour ago

Download sclab.yonsei.ac.kr

Abstract. In the philosophy of behavior-based robotics, design of complex behavior needs the interaction of basic behaviors that are easily implemented. Action selection mechanism ...

Kyung-Joong Kim, Sung-Bae Cho

claim paper

Read More »

193

click to vote

ANSS
2002
IEEE

147views Modeling and Simulation» more ANSS 2002»

Temporal Uncertainty Time Warp: An Agent-Based Implementation

15 years 11 months ago

Download www.dis.uniroma1.it

This paper introduces TUTW – Temporal Uncertainty Time Warp – a control engine designed for an exploitation of temporal uncertainty (TU) in general optimistic simulations, and...

Roberto Beraldi, Libero Nigro, Antonino Orlando, F...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers