We study key issues related to multilingual acoustic modeling for automatic speech recognition (ASR) through a series of large-scale ASR experiments. Our study explores shared str...
Hui Lin, Li Deng, Dong Yu, Yifan Gong, Alex Acero,...
The following article shows how a state-of-the-art speaker diarization system can be improved by combining traditional short-term features (MFCCs) with prosodic and other longterm...
Gerald Friedland, Oriol Vinyals, C. Yan Huang, Chr...
Chroma-based audio features are a well-established tool for analyzing and comparing music data. By identifying spectral components that differ by a musical octave, chroma features...
Local features are widely used for content-based image retrieval and object recognition. We present an efficient method for encoding digital images suitable for local feature extr...
Mina Makar, Chuo-Ling Chang, David M. Chen, Sam S....
Character prototype approaches for writer identification produces a consistent set of templates that are used to model the handwriting styles of writers, thereby allowing high acc...