We present a system capable of interpreting speech commands given by a radiologist in order to accurately diagnose a set of findings and impressions for medical images, such as M...
Tim Weninger, Daniel Greene, Jack Hart, William H....
Although tagging has become increasingly popular in online image and video sharing systems, tags are known to be noisy, ambiguous, incomplete and subjective. These factors can ser...
In this paper, we present an approach for speaker change detection in broadcast video using joint audio-visual scene change statistics. Our experiments indicate that using joint a...
Multimedia Information Retrieval (IR) techniques and associated systems are now numerous and justify the development of strategies and actions to objectively evaluate their capabi...
A novel video retrieval tool based on MPEG-4 video object (VO) representation is presented. The algorithm extends the concept of edge potential functions (EPF), already used in sha...
Minh-Son Dao, Francesco G. B. De Natale, Andrea Ma...