Discriminative confidence estimation along with confidence normalisation have been shown to construct robust decision maker modules in spoken term detection (STD) systems. Discrim...
Javier Tejedor, Doroteo Torre Toledano, Miguel Bau...
Abstract--In this letter, we consider multiple-input singleoutput (MISO) systems with two-way training based transmission. We focus on the long-term system performance and study th...
Xiangyun Zhou, Tharaka A. Lamahewa, Parastoo Sadeg...
Monaural speech separation is a very challenging task. CASAbased systems utilize acoustic features to produce a time-frequency (T-F) mask. In this study, we propose a classificat...
It has been shown repeatedly that iterative relevance feedback is a very efficient solution for content-based image retrieval. However, no existing system scales gracefully to hu...
Music transcription refers to extraction of a human readable and interpretable description from a recording of a music performance. Automatic music transcription remains, nowadays...
Marco Paleari, Benoit Huet, Antony Schutz, Dirk T....