Sciweavers

4446 search results - page 453 / 890
» Learning Observer Agents
Sort
View
ML
2002
ACM
168views Machine Learning» more  ML 2002»
15 years 6 months ago
On Average Versus Discounted Reward Temporal-Difference Learning
We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...
John N. Tsitsiklis, Benjamin Van Roy
VL
2007
IEEE
122views Visual Languages» more  VL 2007»
16 years 1 months ago
Children as Unwitting End-User Programmers
Children who are active on the internet are performing significant design and programming activity without realising it, in the course of hacking little animations, game scripts a...
Marian Petre, Alan F. Blackwell
EVOTING
2004
74views Hardware» more  EVOTING 2004»
15 years 8 months ago
The UK Deployment of the E-Electoral Register
: In this paper we analyse the experience gained in the 2002 and 2003 UK e-voting pilots in the implementation of the e-electoral register of voters. After theoretically establishi...
Alexandros Xenakis, Ann Macintosh
AGENTS
2000
Springer
15 years 11 months ago
Automated assistants to aid humans in understanding team behaviors
Multi-agent teamwork is critical in a large number of agent applications, including training, education, virtual enterprises and collective robotics. Tools that can help humans an...
Taylor Raines, Milind Tambe, Stacy Marsella
ATAL
2006
Springer
15 years 10 months ago
Teaching new teammates
Knowledge transfer between expert and novice agents is a challenging problem given that the knowledge representation and learning algorithms used by the novice learner can be fund...
Doran Chakraborty, Sandip Sen