Sciweavers

3412 search results - page 260 / 683
» Efficient Reinforcement Learning
Sort
View
NN
2006
Springer
140views Neural Networks» more  NN 2006»
15 years 6 months ago
Neural mechanism for stochastic behaviour during a competitive game
Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another...
Alireza Soltani, Daeyeol Lee, Xiao-Jing Wang
TSMC
2008
135views more  TSMC 2008»
15 years 6 months ago
Wholesale Power Price Dynamics Under Transmission Line Limits: A Use of an Agent-Based Intelligent Simulator
Abstract--This research proposes a use of an agent-based intelligent simulator to numerically examine the influence of a transmission line limit on the dynamics of a wholesale powe...
Toshiyuki Sueyoshi, Gopalakrishna Reddy Tadiparthi
AI
2002
Springer
15 years 6 months ago
Programming backgammon using self-teaching neural nets
TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...
Gerald Tesauro
COLT
2010
Springer
15 years 4 months ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura
JMLR
2010
141views more  JMLR 2010»
15 years 1 months ago
Pinview: Implicit Feedback in Content-Based Image Retrieval
This paper describes Pinview, a content-based image retrieval system that exploits implicit relevance feedback during a search session. Pinview contains several novel methods that...
Peter Auer, Zakria Hussain, Samuel Kaski, Arto Kla...