We present a two-layer generative model for sport video mining that is composed of a two-layer observation model. The first layer is the Gaussian mixture model (GMM) using framew...
— This paper describes the hardware architecture for a flexible probability density estimation unit to be used in a Large Vocabulary Speech Recognition System, and targeted for m...
The traditional co-training algorithm, which needs a great number of unlabeled examples in advance and then trains classifiers by iterative learning approach, is not suitable for ...
Automatic multimodal recognition of spontaneous emotional expressions is a largely unexplored and challenging problem. In this paper, we explore audio-visual emotion recognition in...
Zhihong Zeng, Yuxiao Hu, Glenn I. Roisman, Zhen We...
Abstract. This paper deals with the CLEAR 2007 evaluation on the detection of acoustic events which happen during seminars or meetings The implemented system consists in a front-en...