Search Sciweavers | Sciweavers

265 search results - page 16 / 53

» Not Everything We Know We Learned

164

click to vote

JUCS
2007

98views more JUCS 2007»

Focus of Attention in Reinforcement Learning

15 years 5 months ago

Download www.research.rutgers.edu

Abstract: Classiﬁcation-based reinforcement learning (RL) methods have recently been proposed as an alternative to the traditional value-function based methods. These methods use...

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

155

click to vote

DAGSTUHL
2003

95views Software Engineering» more DAGSTUHL 2003»

Toward a Cognitive System Algebra: Application to Facial Expression Learning and Imitation

15 years 7 months ago

Download publi-etis.ensea.fr

In this paper, we try to demonstrate the capability of a very simple architecture to learn to recognize and reproduce facial expressions without the innate capability to recognize ...

Philippe Gaussier, Ken Prepin, Jacqueline Nadel

claim paper

Read More »

166

click to vote

DIGRA
2005
Springer

127views Computer Graphics» more DIGRA 2005»

The Nip and the Bite

15 years 11 months ago

Download www.digra.org

An examination of the contributions that can be made by the field of non-mechanistic cybernetics (as elaborated by Gregory Bateson and Anthony Wilden) to a theory of videogames th...

Darshana Jayemanne

claim paper

Read More »

177

click to vote

NIPS
1996

110views Information Technology» more NIPS 1996»

Predicting Lifetimes in Dynamically Allocated Memory

15 years 7 months ago

Download www.eecs.umich.edu

Predictions oflifetimesofdynamicallyallocated objects can be used to improve time and space e ciency of dynamic memory management in computer programs. Barrett and Zorn 1993] used...

David A. Cohn, Satinder P. Singh

claim paper

Read More »

209

click to vote

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

15 years 4 months ago

Download www.eecs.berkeley.edu

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

« Prev « First page 16 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers