In this paper, we investigate the use of the coupled hidden Markov models (CHMM) for the task of audio-visual text dependent speaker identification. Our system determines the iden...
Tieyan Fu, Xiao Xing Liu, Lu Hong Liang, Xiaobo Pi...
This paper presents a system for video object generation and selective encoding with applications in surveillance, mobile videophones, and automotive industry. Object tracking and...
Alessio Del Bue, Dorin Comaniciu, Visvanathan Rame...
As sensing technologies become increasingly distributed and democratized, citizens and novice users are becoming responsible for the kinds of data collection and analysis that have...
Wesley Willett, Paul M. Aoki, Neil Kumar, Sushmita...
When exposed to environmental noise, speakers adjust their speech production to maintain intelligible communication. This phenomenon, called Lombard effect (LE), is known to consi...
The following article shows how a state-of-the-art speaker diarization system can be improved by combining traditional short-term features (MFCCs) with prosodic and other longterm...
Gerald Friedland, Oriol Vinyals, C. Yan Huang, Chr...