We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...
Children who are active on the internet are performing significant design and programming activity without realising it, in the course of hacking little animations, game scripts a...
: In this paper we analyse the experience gained in the 2002 and 2003 UK e-voting pilots in the implementation of the e-electoral register of voters. After theoretically establishi...
Multi-agent teamwork is critical in a large number of agent applications, including training, education, virtual enterprises and collective robotics. Tools that can help humans an...
Knowledge transfer between expert and novice agents is a challenging problem given that the knowledge representation and learning algorithms used by the novice learner can be fund...