It has been unclear whether optimal experimental design accounts of data selection may offer insight into evidence acquisition tasks in which the learner’s beliefs change greatl...
In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...
A new account of parameter setting during grammatical acquisition is presented in terms of Generalized Categorial Grammar embedded in a default inheritance hierarchy, providing a ...
This article is inspired by recent psychological studies confirming that a child is not born a blank slate but has important innate capabilities. An important part of the "le...
This paper presents a novel framework called proto-reinforcement learning (PRL), based on a mathematical model of a proto-value function: these are task-independent basis function...