Text-to-speech synthesizer systems are of overall good quality, especially when adapted to a specific task. Given this task and an adapted voice corpus, the message quality is mai...
This paper investigates the use of phoneme class conditional probabilities as features (posterior features) for template-based ASR. Using 75 words and 600 words task-independent a...
Serena Soldo, Mathew Magimai-Doss, Joel Pinto, Her...
A real time speaker localization and detection system for videoconferencing environments is presented. In this system, a recently proposed modified Steered Response Power - Phase...
Super-resolution is the task of creating an high resolution image from a low resolution input sequence. To overcome the difficulties of fine image registration, several methods ...
Multicast is a central challenge for emerging multi-hop wireless architectures such as wireless mesh networks, because of its substantial cost in terms of bandwidth. In this artic...