Sciweavers

11539 search results - page 350 / 2308
» On Learning from Exercises
Sort
View
ROBOCUP
2009
Springer
134views Robotics» more  ROBOCUP 2009»
16 years 1 months ago
Learning Complementary Multiagent Behaviors: A Case Study
As the reach of multiagent reinforcement learning extends to more and more complex tasks, it is likely that the diverse challenges posed by some of these tasks can only be address...
Shivaram Kalyanakrishnan, Peter Stone
ICML
2003
IEEE
16 years 7 months ago
AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Oppon
A satisfactory multiagent learning algorithm should, at a minimum, learn to play optimally against stationary opponents and converge to a Nash equilibrium in self-play. The algori...
Vincent Conitzer, Tuomas Sandholm
ESANN
2008
15 years 8 months ago
Learning to play Tetris applying reinforcement learning methods
In this paper the application of reinforcement learning to Tetris is investigated, particulary the idea of temporal difference learning is applied to estimate the state value funct...
Alexander Groß, Jan Friedland, Friedhelm Sch...
TAPSOFT
1995
Springer
15 years 10 months ago
Anatomy of the Pentium Bug
The Pentium computer chip’s division algorithm relies on a table from which five entries were inadvertently omitted, with the result that 1738 single precision dividenddivisor ...
Vaughan R. Pratt
BMCBI
2004
211views more  BMCBI 2004»
15 years 6 months ago
GenomeViz: visualizing microbial genomes
Background: An increasing number of microbial genomes are being sequenced and deposited in public databases. In addition, several closely related strains are also being sequenced ...
Rohit Ghai, Torsten Hain, Trinad Chakraborty