Sciweavers

2047 search results - page 237 / 410
» The limits of speech recognition
Sort
View
ICIP
2003
IEEE
16 years 8 months ago
On automatic annotation of meeting databases
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and m...
Daniel Gatica-Perez, Hervé Bourlard, Iain M...
GW
2007
Springer
135views Biometrics» more  GW 2007»
16 years 22 days ago
Enhancing a Sign Language Translation System with Vision-Based Features
Abstract. In automatic sign language translation, one of the main problems is the usage of spatial information in sign language and its proper representation and translation, e.g. ...
Philippe Dreuw, Daniel Stein, Hermann Ney
ICDE
2006
IEEE
262views Database» more  ICDE 2006»
16 years 19 days ago
The eNTERFACE'05 Audio-Visual Emotion Database
This paper presents an audio-visual emotion database that can be used as a reference database for testing and evaluating video, audio or joint audio-visual emotion recognition alg...
O. Martin, Irene Kotsia, Benoit M. Macq, Ioannis P...
ICMCS
2005
IEEE
123views Multimedia» more  ICMCS 2005»
16 years 5 days ago
Improved face finding in visually challenging environments
Finding faces in visually challenging environments is crucial to many applications, such as audio-visual automatic speech recognition, video indexing, person recognition, and vide...
Jintao Jiang, Gerasimos Potamianos, Giridharan Iye...
ICMCS
2005
IEEE
169views Multimedia» more  ICMCS 2005»
16 years 5 days ago
Dynamic language model adaptation using latent topical information and automatic transcripts
This paper considers dynamic language model adaptation for Mandarin broadcast news recognition. Both contemporary newswire texts and in-domain automatic transcripts were exploited...
Berlin Chen