Sciweavers

1527 search results - page 142 / 306
» Hidden word statistics
Sort
View
ACL
2004
15 years 8 months ago
FLSA: Extending Latent Semantic Analysis with Features for Dialogue Act Classification
We discuss Feature Latent Semantic Analysis (FLSA), an extension to Latent Semantic Analysis (LSA). LSA is a statistical method that is ordinarily trained on words only; FLSA adds...
Riccardo Serafin, Barbara Di Eugenio
EACL
2003
ACL Anthology
15 years 8 months ago
Empirical Methods for Compound Splitting
Compounded words are a challenge for NLP applications such as machine translation (MT). We introduce methods to learn splitting rules from monolingual and parallel corpora. We eva...
Philipp Koehn, Kevin Knight
NIPS
2004
15 years 8 months ago
Integrating Topics and Syntax
Statistical approaches to language learning typically focus on either short-range syntactic dependencies or long-range semantic dependencies between words. We present a generative...
Thomas L. Griffiths, Mark Steyvers, David M. Blei,...
ACL
1997
15 years 8 months ago
Document Classification Using a Finite Mixture Model
We propose a new method of classifying documents into categories. We define for each category a finite mixture model based on soft clustering of words. We treat the problem of cla...
Hang Li, Kenji Yamanishi
IDA
2007
Springer
15 years 6 months ago
Voting experts: An unsupervised algorithm for segmenting sequences
We describe a statistical signature of chunks and an algorithm for finding chunks. While there is no formal definition of chunks, they may be reliably identified as configurat...
Paul R. Cohen, Niall M. Adams, Brent Heeringa