We describe a exible model for representing images of objects of a certain class, known a priori, such as faces, and introduce a new algorithm for matching it to a novel image and...
Abstract. Movies and TV are a rich source of diverse and complex video of people, objects, actions and locales "in the wild". Harvesting automatically labeled sequences o...
Timothee Cour, Chris Jordan, Eleni Miltsakaki, Ben...
We propose a method for measuring the quality of a grouping result, based on the following observation: a better grouping result provides more information about the true, unknown g...
Erik A. Engbers, Michael Lindenbaum, Arnold W. M. ...
Abstract. A novel algorithm is presented for the 3D reconstruction of human action in long (> 30 second) monocular image sequences.A sequence is represented by a small set of au...
Gareth Loy, Martin Eriksson, Josephine Sullivan, S...
Abstract. We present a new approach to modeling and processing multimedia data. This approach is based on graphical models that combine audio and video variables. We demonstrate it...