The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations and other body motion, such as those of the head, convey additional information...
Iain Matthews, Timothy F. Cootes, J. Andrew Bangha...
We propose a fully automatic method for summarizing and indexing unstructured presentation videos based on text extracted from the projected slides. We use changes of text in the ...
Handshape is a key linguistic component of signs, and thus, handshape recognition is essential to algorithms for sign language recognition and retrieval. In this work, linguistic ...
Ashwin Thangali, Stan Sclaroff, Carol Neidle, Joan...
In a current project, customers are attracted by a video streaming application. A video camera records people passing by, and a monitor shows an alienated version of the setting a...
We describe a new expression database which contains video sequences of both played and natural expressions and an expression classification system based on warped optical flow fi...
James Skelley, Robert Fischer, Arup Sarma, Bernd H...