This paper proposes an efficient and robust background compensation method for pan-tilt-zoom cameras. The proposed method approximates the relation between consecutive images to a...
Jae Kyu Suhr, Ho Gi Jung, Gen Li, Seung-In Noh, Ja...
Abstract--In this paper, we introduce a novel approach for improved nonlinear system identification in the short-time Fourier transform (STFT) domain. We first derive explicit repr...
A real time speaker localization and detection system for videoconferencing environments is presented. In this system, a recently proposed modified Steered Response Power - Phase...
This work presents a new approach to discriminative speaker verification. Rather than estimating speaker models, or a model that discriminates between a speaker class and the cla...
The goal of the work described here is to limit the computation needed in unit selection Viterbi search for text-to-speech synthesis. The broader goal is to improve speech quality...