Sciweavers

5758 search results - page 240 / 1152
» Anonymity-preserving data collection
Sort
View
LREC
2008
110views Education» more  LREC 2008»
15 years 8 months ago
New Telephone Speech Databases for French: a Children Database and an optimized Adult Corpus
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult database for the French language. A new approach was adopted for the collection...
Djamel Mostefa, Arnaud Vallee
DAGSTUHL
2006
15 years 8 months ago
Information Access to Historical Documents from the Early New High German Period
With the new interest in historical documents insight grew that electronic access to these texts causes many specific problems. In the first part of the paper we survey the presen...
Andreas Hauser, Markus Heller, Elisabeth Leiss, Kl...
EACL
2006
ACL Anthology
15 years 8 months ago
Web Text Corpus for Natural Language Processing
Web text has been successfully used as training data for many NLP applications. While most previous work accesses web text through search engine hit counts, we created a Web Corpu...
Vinci Liu, James R. Curran
ACL
2001
15 years 8 months ago
Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems
This paper describes the application of the PARADISE evaluation framework to the corpus of 662 human-computer dialogues collected in the June 2000 Darpa Communicator data collecti...
Marilyn A. Walker, Rebecca J. Passonneau, Julie E....
NIPS
2001
15 years 8 months ago
Latent Dirichlet Allocation
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian m...
David M. Blei, Andrew Y. Ng, Michael I. Jordan