Professional manual transcription of speech is an expensive and time consuming process. This paper focuses on the problem of combining noisy transcriptions from multiple non-exper...
Kartik Audhkhasi, Panayiotis G. Georgiou, Shrikant...
In video coding systems using adaptive arithmetic coding to compress texture information, the employed symbol probability models need to be retrained every time the coding process...
Kenneth Vermeirsch, Joeri Barbarien, Peter Lambert...
People take more and more photos at different time and different events, however, these photos are often put into one giant folder and they are seldom annotated or organized. As t...
In this paper the authors are addressing the concerns associated with fast growing DSP chips and tools and the impact they have on teaching DSP implementation. The authors also pr...
We present an analysis of F0 range and peak alignment in emotional speech from a heterogeneous group of speakers varying in age and gender. Both speaker and emotion had a strong e...
Eric Morley, Jan P. H. van Santen, Esther Klabbers...