Sciweavers

1973 search results - page 49 / 395
» Video retrieval using speech and image information
Sort
View
SIGIR
2009
ACM
16 years 24 days ago
Automatic video tagging using content redundancy
The analysis of the leading social video sharing platform YouTube reveals a high amount of redundancy, in the form of videos with overlapping or duplicated content. In this paper,...
Stefan Siersdorfer, José San Pedro, Mark Sa...
ICASSP
2011
IEEE
14 years 10 months ago
Voxel-based Viterbi Active Speaker Tracking (V-VAST) with best view selection for video lecture post-production
An automated system is presented for reducing a multi-view lecture recording into a single view video containing a best view summary of active speakers. The system uses skin color...
Damien Kelly, Anil Kokaram, Frank Boland
MM
2003
ACM
120views Multimedia» more  MM 2003»
15 years 11 months ago
Linking multimedia presentations with their symbolic source documents: algorithm and applications
An algorithm is presented that automatically matches images of presentation slides to the symbolic source file (e.g., PowerPointTM or AcrobatTM ) from which they were generated. T...
Berna Erol, Jonathan J. Hull, Dar-Shyang Lee
TMM
2008
126views more  TMM 2008»
15 years 6 months ago
Extraction of Audio Features Specific to Speech Production for Multimodal Speaker Detection
A method that exploits an information theoretic framework to extract optimized audio features using video information is presented. A simple measure of mutual information (MI) betw...
Patricia Besson, Vlad Popovici, Jean-Marc Vesin, J...
CLEF
2005
Springer
15 years 11 months ago
The Use of MedGIFT and EasyIR for ImageCLEF 2005
This article describes the use of the medGIFT and easyIR retrieval systems for three of the four ImageCLEF 2005 retrieval tasks. We participated in the ad–hoc retrieval task that...
Henning Müller, Antoine Geissbühler, Joh...