Sciweavers

1834 search results - page 103 / 367
» Online Learning in Monkeys
Sort
View
ICML
2008
IEEE
16 years 7 months ago
On-line discovery of temporal-difference networks
We present an algorithm for on-line, incremental discovery of temporal-difference (TD) networks. The key contribution is the establishment of three criteria to expand a node in TD...
Takaki Makino, Toshihisa Takagi
ALT
2008
Springer
16 years 3 months ago
Online Regret Bounds for Markov Decision Processes with Deterministic Transitions
Abstract. We consider an upper confidence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...
Ronald Ortner
ICCV
2007
IEEE
16 years 21 days ago
Co-Tracking Using Semi-Supervised Support Vector Machines
This paper treats tracking as a foreground/background classification problem and proposes an online semisupervised learning framework. Initialized with a small number of labeled ...
Feng Tang, Shane Brennan, Qi Zhao, Hai Tao
OSDI
2008
ACM
16 years 6 months ago
From Optimization to Regret Minimization and Back Again
Internet routing is mostly based on static information-it's dynamicity is limited to reacting to changes in topology. Adaptive performance-based routing decisions would not o...
Ioannis C. Avramopoulos, Jennifer Rexford, Robert ...
IUI
2006
ACM
16 years 10 days ago
Are two talking heads better than one?: when should use more than one agent in e-learning?
Recent interest in the use of software character agents raises the issue of how many agents should be used in online learning. In this paper we review evidence concerning the rela...
Hua Wang, Mark H. Chignell, Mitsuru Ishizuka