Sciweavers

2951 search results - page 334 / 591
» Trustable Task Processing Systems
Sort
View
ICASSP
2009
IEEE
16 years 1 months ago
Multi-modal speaker diarization of real-world meetings using compressed-domain video features
Speaker diarization is originally defined as the task of determining “who spoke when” given an audio track and no other prior knowledge of any kind. The following article sho...
Gerald Friedland, Hayley Hung, Chuohao Yeo
ICASSP
2008
IEEE
16 years 1 months ago
Modulation forensics for wireless digital communications
Modulation forensics is to detect the modulation type in wireless communications without any prior information. It nds both military and civilian applications such as surveillance...
W. Sabrina Lin, K. J. Ray Liu
ICASSP
2008
IEEE
16 years 1 months ago
Audio cover song identification based on tonal sequence alignment
Nowadays, the term cover song (or simply cover) can mean any new version, performance, rendition, or recording of a previously recorded track. Cover song identification is a task...
Joan Serrà, Emilia Gómez
ICASSP
2008
IEEE
16 years 1 months ago
Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition
The fusion of information from heterogenous sensors is crucial to the effectiveness of a multimodal system. Noise affect the sensors of different modalities independently. A good ...
Shankar T. Shivappa, Bhaskar D. Rao, Mohan M. Triv...
ICASSP
2008
IEEE
16 years 1 months ago
Confidence scores for acoustic model adaptation
This paper focuses on confidence scores for use in acoustic model adaptation. Frame-based confidence estimates are used in linear transform (CMLLR and MLLR) and MAP adaptation. ...
Christian Gollan, Michiel Bacchiani