Sciweavers

265 search results - page 16 / 53
» Not Everything We Know We Learned
Sort
View
JUCS
2007
98views more  JUCS 2007»
15 years 5 months ago
Focus of Attention in Reinforcement Learning
Abstract: Classification-based reinforcement learning (RL) methods have recently been proposed as an alternative to the traditional value-function based methods. These methods use...
Lihong Li, Vadim Bulitko, Russell Greiner
DAGSTUHL
2003
15 years 7 months ago
Toward a Cognitive System Algebra: Application to Facial Expression Learning and Imitation
In this paper, we try to demonstrate the capability of a very simple architecture to learn to recognize and reproduce facial expressions without the innate capability to recognize ...
Philippe Gaussier, Ken Prepin, Jacqueline Nadel
DIGRA
2005
Springer
15 years 11 months ago
The Nip and the Bite
An examination of the contributions that can be made by the field of non-mechanistic cybernetics (as elaborated by Gregory Bateson and Anthony Wilden) to a theory of videogames th...
Darshana Jayemanne
NIPS
1996
15 years 7 months ago
Predicting Lifetimes in Dynamically Allocated Memory
Predictions oflifetimesofdynamicallyallocated objects can be used to improve time and space e ciency of dynamic memory management in computer programs. Barrett and Zorn 1993] used...
David A. Cohn, Satinder P. Singh
COLT
2010
Springer
15 years 4 months ago
Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback
Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...
Alekh Agarwal, Ofer Dekel, Lin Xiao