Abstract. The recognition of vowels in Chinese speech is very important for Chinese speech recognition and understanding. However, it is rather difficult and there has been no effi...
In this paper we present an integrated approach for semantic structure extraction in document images. Document images are initially processed to extract both their layout and logic...
In order to increase the role of machines in supporting more capabilities as regards a spoken dialogue system, we present in this paper a new problem incorporating multi-session in...
The NITE XML Toolkit (NXT) provides library support for working with multimodal language corpora. We describe work in progress to explore its potential for the AMI project by appl...
This paper presents a novel face detection approach in color images. We employ spatial histograms as robust features for face detection. The spatial histograms consist of marginal ...