In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...
We connect two scenarios in structured learning: adapting a parser trained on one corpus to another annotation style, and projecting syntactic annotations from one language to ano...
Abstract: Ambient assisted living (AAL) is a newly emerging term describing a research area with focus on services that support people in their daily life with particular focus on ...
Reusing or modifying known design concepts cannot meet new challenges for engineering systems. However, engineers can find inspiration outside their traditional domains in order to...
We present a novel technique for automated problem decomposition to address the problem of scalability in reinforcement learning. Our technique makes use of a set of near-optimal ...
Peng Zang, Peng Zhou, David Minnen, Charles Lee Is...