In this paper, we present a multi-label sparse coding
framework for feature extraction and classification within
the context of automatic image annotation. First, each image
is ...
Changhu Wang (University of Science and Technology...
This paper exploits the context of natural dynamic scenes
for human action recognition in video. Human actions
are frequently constrained by the purpose and the physical
propert...
Marcin Marszalek (INRIA), Ivan Laptev (INRIA), Cor...
High-level, or holistic, scene understanding involves
reasoning about objects, regions, and the 3D relationships
between them. This requires a representation above the
level of ...
We describe a learning based method for recovering 3D human body pose from single images and monocular image sequences. Our approach requires neither an explicit body model nor pr...
In this paper, we propose a probabilistic videobased facial expression recognition method on manifolds. The concept of the manifold of facial expression is based on the observatio...