Sciweavers

5647 search results - page 333 / 1130
» A word from the editor
Sort
View
ACCV
2010
Springer
15 years 1 months ago
Optimizing Visual Vocabularies Using Soft Assignment Entropies
The state of the art for large database object retrieval in images is based on quantizing descriptors of interest points into visual words. High similarity between matching image r...
Yubin Kuang, Kalle Åström, Lars Kopp, M...
INTERSPEECH
2010
15 years 1 months ago
FSM-based pronunciation modeling using articulatory phonological code
According to articulatory phonology, the gestural score is an invariant speech representation. Though the timing schemes, i.e., the onsets and offsets, of the gestural activations...
Chi Hu, Xiaodan Zhuang, Mark Hasegawa-Johnson
ICASSP
2011
IEEE
14 years 10 months ago
Enriching Mandarin speech recognition by incorporating a hierarchical prosody model
This paper presents a new probabilistic framework of Mandarin speech recognition by incorporating a sophisticated hierarchical prosody model into the conventional HMM-based system...
Jyh-Her Yang, Ming-Chieh Liu, Hao-Hsiang Chang, Ch...
ICCV
2011
IEEE
14 years 6 months ago
End-to-end Scene Text Recognition
This paper focuses on the problem of word detection and recognition in natural images. The problem is significantly more challenging than reading text in scanned documents, and h...
Kai Wang, Boris Babenko, Serge Belongie
CRV
2011
IEEE
278views Robotics» more  CRV 2011»
14 years 6 months ago
Online Visual Vocabularies
Abstract—The idea of an online visual vocabulary is proposed. In contrast to the accepted strategy of generating vocabularies offline, using the k-means clustering over all the ...
Yogesh A. Girdhar, Gregory Dudek