Abstract— We consider the task of omnidirectional path following for a quadruped robot: moving a four-legged robot along any arbitrary path while turning in any arbitrary manner....
This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
Background subtraction is a widely used paradigm to detect moving objects in video taken from a static camera and is used for various important applications such as video surveill...
See www.research.microsoft.com/jojic/epitome.htm for videos, comparisons and applications. We present novel simple appearance and shape models that we call epitomes. The epitome o...
Based on perceptual and computational attention modeling studies, we formulate measures of saliency for an audiovisual stream. Audio saliency is captured by signal modulations and...