Search Sciweavers | Sciweavers

193

NIPS
2003

207views Information Technology» more NIPS 2003»

Extending Q-Learning to General Adaptive Multi-Agent Systems

15 years 8 months ago

Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...

Gerald Tesauro

claim paper

Read More »

189

click to vote

OPODIS
2003

133views Distributed And Parallel Com...» more OPODIS 2003»

Linear Time Byzantine Self-Stabilizing Clock Synchronization

15 years 8 months ago

Download www.cs.huji.ac.il

Awareness of the need for robustness in distributed systems increases as distributed systems become an integral part of day-to-day systems. Tolerating Byzantine faults and possessi...

Ariel Daliot, Danny Dolev, Hanna Parnas

claim paper

Read More »

163

click to vote

AAAI
1998

175views Intelligent Agents» more AAAI 1998»

Bayesian Q-Learning

15 years 8 months ago

Download www.aaai.org

A central problem in learning in complex environmentsis balancing exploration of untested actions against exploitation of actions that are known to be good. The benefit of explora...

Richard Dearden, Nir Friedman, Stuart J. Russell

claim paper

Read More »

176

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

15 years 8 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

178

click to vote

AAAI
1998

132views Intelligent Agents» more AAAI 1998»

Learning to Classify Text from Labeled and Unlabeled Documents

15 years 8 months ago

Download www.kamalnigam.com

In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper sh...

Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers