A convenient representation of a video segment is a single “keyframe.” Keyframes are widely used in applications such as non-linear browsing and video editing. With existing m...
We present a hybrid speaker tracking scheme based on a single pan/tilt/zoom (PTZ) camera in an automated lecture capturing system. Given that the camera’s video resolution is hi...
Cha Zhang, Yong Rui, Li-wei He, Michael N. Wallick
Abstract. This paper presents Textable Movie, an open-ended interface that allows anyone to become "video-jockey." In the framework of computational storytelling, Textabl...
Extracting automatically the semantics from visual data is a real challenge. We describe in this paper how recent work in cognitive vision leads to significative results in activi...
Abstract--In this paper we address the problem of unsupervised discovery of action classes in video data. Different from all existing methods thus far proposed for this task, we pr...