Search Sciweavers | Sciweavers

3412 search results - page 260 / 683

» Efficient Reinforcement Learning

171

click to vote

NN
2006
Springer

140views Neural Networks» more NN 2006»

Neural mechanism for stochastic behaviour during a competitive game

15 years 6 months ago

Download wanglab.med.yale.edu

Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another...

Alireza Soltani, Daeyeol Lee, Xiao-Jing Wang

claim paper

Read More »

181

click to vote

TSMC
2008

135views more TSMC 2008»

Wholesale Power Price Dynamics Under Transmission Line Limits: A Use of an Agent-Based Intelligent Simulator

15 years 6 months ago

Download www.icasa.nmt.edu

Abstract--This research proposes a use of an agent-based intelligent simulator to numerically examine the influence of a transmission line limit on the dynamics of a wholesale powe...

Toshiyuki Sueyoshi, Gopalakrishna Reddy Tadiparthi

claim paper

Read More »

164

click to vote

AI
2002
Springer

117views Artificial Intelligence» more AI 2002»

Programming backgammon using self-teaching neural nets

15 years 6 months ago

Download www.math-info.univ-paris5.fr

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...

Gerald Tesauro

claim paper

Read More »

185

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 4 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

212

click to vote

JMLR
2010

141views more JMLR 2010»

Pinview: Implicit Feedback in Content-Based Image Retrieval

15 years 1 months ago

Download jmlr.csail.mit.edu

This paper describes Pinview, a content-based image retrieval system that exploits implicit relevance feedback during a search session. Pinview contains several novel methods that...

Peter Auer, Zakria Hussain, Samuel Kaski, Arto Kla...

claim paper

Read More »

« Prev « First page 260 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers