The paper presents a real-time speaker identification system based on the analysis of the audio track of a video stream. The system has been employed in the context of automatic v...
Luigi P. Cordella, Pasquale Foggia, Carlo Sansone,...
Many real-world applications call for learning predictive relationships from multi-modal data. In particular, in multi-media and web applications, given a dataset of images and th...
A new computational model for active visual attention is introduced in this paper. The method extracts motion and shape features from video image sequences, and integrates these f...
This paper presents a fast and robust sprite generation algorithm for MPEG-4 video coding. Our contributions consist of two aspects. Firstly, a fast and robust Global Motion Estima...
This paper presents a system for video object generation and selective encoding with applications in surveillance, mobile videophones, and automotive industry. Object tracking and...
Alessio Del Bue, Dorin Comaniciu, Visvanathan Rame...