We assess the current state of the art in speech summarization, by comparing a typical summarizer on two different domains: lecture data and the SWITCHBOARD corpus. Our results ca...
Video question answering aims to pinpoint answers in response to user's specified questions. However, most question answering technologies involve in integrating rich specifi...
Fluorescence microscopy is a powerful imaging tool for studying molecular dynamics in living cells. For quantitative motion analysis of subcellular structures robust and accurate ...
We present an automatic and efficient method to extract spatio-temporal human volumes from video, which combines top-down model-based and bottom-up appearancebased approaches. Fr...
A user-centric entity detection system is one in which the primary consumer of the detected entities is a person who can perform actions on the detected entities (e.g. perform a s...