Sciweavers

4035 search results - page 284 / 807
» Useless Actions Are Useful
Sort
View
GLOBECOM
2007
IEEE
16 years 1 months ago
Constrained Stochastic Games in Wireless Networks
—We consider the situation where N nodes share a common access point. With each node i there is an associated buffer and channel state that change in time. Node i dynamically cho...
Eitan Altaian, Konstantin Avrachenkov, Nicolas Bon...
ICRA
2007
IEEE
128views Robotics» more  ICRA 2007»
16 years 29 days ago
Adaptive Play Q-Learning with Initial Heuristic Approximation
Abstract— The problem of an effective coordination of multiple autonomous robots is one of the most important tasks of the modern robotics. In turn, it is well known that the lea...
Andriy Burkov, Brahim Chaib-draa
ECAL
2007
Springer
16 years 25 days ago
From Solitary to Collective Behaviours: Decision Making and Cooperation
In a social scenario, establishing whether a collaboration is required to achieve a certain goal is a complex problem that requires decision making capabilities and coordination am...
Vito Trianni, Christos Ampatzis, Anders Lyhne Chri...
AVSS
2006
IEEE
16 years 22 days ago
Learning Foveal Sensing Strategies in Unconstrained Surveillance Environments
In this paper we report on techniques for automatically learning foveal sensing strategies for an active pan-tiltzoom camera. The approach uses reinforcement learning to discover ...
Andrew D. Bagdanov, Alberto Del Bimbo, Walter Nunz...
SAC
2005
ACM
16 years 7 days ago
Reinforcement learning agents with primary knowledge designed by analytic hierarchy process
This paper presents a novel model of reinforcement learning agents. A feature of our learning agent model is to integrate analytic hierarchy process (AHP) into a standard reinforc...
Kengo Katayama, Takahiro Koshiishi, Hiroyuki Narih...