Recent work shows how to use local spatio-temporal features to learn models of realistic human actions from video. However, existing methods typically rely on a predefined spatial...
In this paper, we propose a novel Spatiotemporal Interest Point (MC-STIP) detector based on the coherent motion pattern around each voxel in videos. Our detector defines the local...
The quest for a vision system capable of representing and recognizing arbitrary motions benefits from a low dimensional, non-specific representation of flow fields, to be used in ...
Given an input video sequence of one person conducting a sequence of continuous actions, we consider the problem of jointly segmenting and recognizing actions. We propose a discri...
Qinfeng Shi, Li Wang, Li Cheng, Alexander J. Smola