Search Sciweavers | Sciweavers

3808 search results - page 573 / 762

» Artificial Intelligence: A Modern Approach

168

click to vote

ATAL
2008
Springer

124views Intelligent Agents» more ATAL 2008»

Social reward shaping in the prisoner's dilemma

15 years 8 months ago

Download www.aamas-conference.org

Reward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to nearoptimal behavior. In this paper, we introduce social reward sha...

Monica Babes, Enrique Munoz de Cote, Michael L. Li...

claim paper

Read More »

177

click to vote

ATAL
2008
Springer

156views Intelligent Agents» more ATAL 2008»

Teaching multi-robot coordination using demonstration of communication and state sharing

15 years 8 months ago

Download www.cs.cmu.edu

Solutions to complex tasks often require the cooperation of multiple robots, however, developing multi-robot policies can present many challenges. In this work, we introduce teach...

Sonia Chernova, Manuela M. Veloso

claim paper

Read More »

171

click to vote

ATAL
2008
Springer

101views Intelligent Agents» more ATAL 2008»

Optimized algorithms for multi-agent routing

15 years 8 months ago

Download www.aamas-conference.org

Auction methods have been successfully used for coordinating teams of robots in the multi-robot routing problem, a representative domain for multi-agent coordination. Solutions to...

Akihiro Kishimoto, Nathan R. Sturtevant

claim paper

Read More »

204

click to vote

ATAL
2006
Springer

160views Intelligent Agents» more ATAL 2006»

Selecting informative actions improves cooperative multiagent learning

15 years 8 months ago

Download cs.gmu.edu

In concurrent cooperative multiagent learning, each agent simultaneously learns to improve the overall performance of the team, with no direct control over the actions chosen by i...

Liviu Panait, Sean Luke

claim paper

Read More »

166

click to vote

ATAL
2010
Springer

141views Intelligent Agents» more ATAL 2010»

Risk-sensitive planning in partially observable environments

15 years 7 months ago

Download www.aamas-conference.org

Partially Observable Markov Decision Process (POMDP) is a popular framework for planning under uncertainty in partially observable domains. Yet, the POMDP model is riskneutral in ...

Janusz Marecki, Pradeep Varakantham

claim paper

Read More »

« Prev « First page 573 / 762 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers