We address the task of separation of music and effects from dialogs in film or television soundtracks. This is of interest for film studios wanting to release films in new, pre...
This paper presents a novel method of visual saliency detection. The use of saliency promises benefits to multimedia applications. However, up to now just few reasonable applicat...
Direction of arrival (DOA) estimation using sensor array superresolution techniques are known to suffer from array modeling errors including array element displacements, mutual co...
We propose a novel multi-stream framework for continuous conversational speech recognition which employs bidirectional Long Short-Term Memory (BLSTM) networks for phoneme predicti...
In this study we focus on the relationship between the talker-tolistener distance (TLD) and the dynamics of speech intensity and fundamental frequency. A new experiment for the ex...