Sciweavers

3274 search results - page 349 / 655
» Using Learning in a Control Agent
Sort
View
AAAI
2011
14 years 6 months ago
Differential Eligibility Vectors for Advantage Updating and Gradient Methods
In this paper we propose differential eligibility vectors (DEV) for temporal-difference (TD) learning, a new class of eligibility vectors designed to bring out the contribution of...
Francisco S. Melo
AAAI
2004
15 years 8 months ago
Towards Autonomic Computing: Adaptive Job Routing and Scheduling
Computer systems are rapidly becoming so complex that maintaining them with human support staffs will be prohibitively expensive and inefficient. In response, visionaries have beg...
Shimon Whiteson, Peter Stone
AAAI
1998
15 years 8 months ago
Iterated Phantom Induction: A Little Knowledge Can Go a Long Way
Weadvance a knowledge-based learning method that augments conventional generalization to permit concept acquisition in failure domains. These are domains in whichlearning must pro...
Mark Brodie, Gerald DeJong
ICML
2004
IEEE
16 years 7 months ago
Learning and discovery of predictive state representations in dynamical systems with reset
Predictive state representations (PSRs) are a recently proposed way of modeling controlled dynamical systems. PSR-based models use predictions of observable outcomes of tests that...
Michael R. James, Satinder P. Singh
IJAMC
2007
152views more  IJAMC 2007»
15 years 6 months ago
Navigating a 3D virtual environment of learning objects by hand gestures
: This paper presents a gesture-based Human-Computer Interface (HCI) to navigate a learning object repository mapped in a 3D virtual environment. With this interface, the user can ...
Qing Chen, Abu Saleh Md. Mahfujur Rahman, Xiaojun ...