. Traditional camera pedestals are manually operated. Our long term goal is to construct a fully autonomous pedestal system which can respond to changes in a scene and mimicking t...
Richard Yi Da Xu, Joshua M. Brown, Jason M. Traish...
Audio segmentation has applications in a variety of contexts, such as audio information retrieval, automatic sound analysis, and as a pre-processing step in speech recognition. Ex...
Tara N. Sainath, Dimitri Kanevsky, Giridharan Iyen...
We present a two-layer generative model for sport video mining that is composed of a two-layer observation model. The first layer is the Gaussian mixture model (GMM) using framew...
— The focus of this paper is mental tension detection in speech to assist control the tension in day-to-day business such as conferences and operations in a call center. It is di...
In this paper, we present a new solution to the problem of multi-camera tracking with non-overlapping fields of view. The identities of moving objects are maintained when they are...