Sciweavers

1834 search results - page 126 / 367
» Online Learning in Monkeys
Sort
View
COLT
2010
Springer
15 years 4 months ago
Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback
Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...
Alekh Agarwal, Ofer Dekel, Lin Xiao
JMLR
2010
119views more  JMLR 2010»
15 years 1 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
WEBNET
2000
15 years 7 months ago
New Approaches to Law Education: Making the Case for Web-based Learning
: Web-based instruction and online learning are changing customary practices in education. As conventional patterns for content delivery are influenced by new and improving technol...
Jennifer Gramling, Tom Galligan, Jean A. Derco
WWW
2006
ACM
16 years 7 months ago
Detecting online commercial intention (OCI)
Understanding goals and preferences behind a user's online activities can greatly help information providers, such as search engine and E-Commerce web sites, to personalize c...
Honghua (Kathy) Dai, Lingzhi Zhao, Zaiqing Nie, Ji...
ICPR
2008
IEEE
16 years 26 days ago
Signature verification based on fusion of on-line and off-line kernels
The problem of signature verification is considered within the bounds of the kernel-based methodology of pattern recognition, more specifically, SVM principle of machine learning....
Vadim Mottl, Mikhail Lange, Valentina Sulimova, Al...