Sciweavers

2406 search results - page 370 / 482
» Visualizations: speech, language
Sort
View
LREC
2010
136views Education» more  LREC 2010»
15 years 7 months ago
The CLARIN-NL Project
In this paper I present the CLARIN-NL project, the Dutch national project that aims to play a central role in the European CLARIN infrastructure, not only for the preparatory phas...
Jan Odijk
LREC
2010
189views Education» more  LREC 2010»
15 years 7 months ago
CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation
CASIA-CASSIL is a large-scale corpus base of Chinese human-human naturally-occurring telephone conversations in restricted domains. The first edition consists of 792 90-second con...
Keyan Zhou, Aijun Li, Zhigang Yin, Chengqing Zong
EMNLP
2007
15 years 7 months ago
Extending a Thesaurus in the Pan-Chinese Context
In this paper, we address a unique problem in Chinese language processing and report on our study on extending a Chinese thesaurus with region-specific words, mostly from the fina...
Oi Yee Kwong, Benjamin Ka-Yin T'sou
LREC
2010
159views Education» more  LREC 2010»
15 years 7 months ago
Towards Optimal TTS Corpora
Unit selection text-to-speech systems currently produce very natural synthesized phrases by concatenating speech segments from a large database. Recently, increasing demand for de...
Didier Cadic, Cédric Boidin, Christophe d'A...
EMNLP
2008
15 years 7 months ago
Bayesian Unsupervised Topic Segmentation
This paper describes a novel Bayesian approach to unsupervised topic segmentation. Unsupervised systems for this task are driven by lexical cohesion: the tendency of wellformed se...
Jacob Eisenstein, Regina Barzilay