Sciweavers

3509 search results - page 365 / 702
» Searching the Web by Voice
Sort
View
MM
2006
ACM
157views Multimedia» more  MM 2006»
16 years 23 days ago
Syllabic level automatic synchronization of music signals and text lyrics
We present a framework to synchronize pop music to corresponding text lyric. We refine line level alignment achievable by existing work to syllabic level by using a dynamic progra...
Denny Iskandar, Ye Wang, Min-Yen Kan, Haizhou Li
WACV
2002
IEEE
15 years 11 months ago
Automatic Detection of Signs with Affine Transformation
In this paper, we propose an approach for detecting signs from natural scenes. The approach efficiently embeds multiresolution, adaptive search, and affine rectification algorithm...
Xilin Chen, Jie Yang, Jing Zhang, Alex Waibel
ICASSP
2010
IEEE
15 years 7 months ago
Discriminative template extraction for direct modeling
This paper addresses the problem of developing appropriate features for use in direct modeling approaches to speech recognition, such as those based on Maximum Entropy models or S...
Shankar Shivappa, Patrick Nguyen, Geoffrey Zweig
ICASSP
2010
IEEE
15 years 7 months ago
Recognition of phonemes and words in singing
This paper studies the influence of n-gram language models in the recognition of sung phonemes and words. We train uni-, bi-, and trigram language models for phonemes and bi- and...
Annamaria Mesaros, Tuomas Virtanen
ICMCS
2009
IEEE
164views Multimedia» more  ICMCS 2009»
15 years 4 months ago
Audio-based classification of speaker characteristics
The human voice is primarily a carrier of speech, but it also contains non-linguistic features unique to a speaker and indicative of various speaker demographics, e.g. gender, nat...
Promiti Dutta, Alexander Haubold