Sciweavers

11538 search results - page 354 / 2308
» From Text to Knowledge
Sort
View
SIGIR
1995
ACM
15 years 10 months ago
Noise Reduction in a Statistical Approach to Text Categorization
This paper studies noise reduction for computational efficiency improvements in a statistical learning method for text categorization, the Linear Least Squares Fit (LLSF) mapping...
Yiming Yang
FLAIRS
2007
15 years 9 months ago
Cohesion and Structural Organization in High School Texts
Recent research in reading comprehension supports the hypothesis that readers are aided by textual cohesion. Traditional readability formulas are not able to effectively assess le...
Erin J. Lightman, Philip M. McCarthy, David F. Duf...
ESANN
2007
15 years 8 months ago
Kernel PCA based clustering for inducing features in text categorization
We study dimensionality reduction or feature selection in text document categorization problem. We focus on the first step in building text categorization systems, that is the cho...
Zsolt Minier, Lehel Csató
LREC
2008
88views Education» more  LREC 2008»
15 years 8 months ago
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization
Tokenization is one of the initial steps done for almost any text processing task. It is not particularly recognized as a challenging task for English monolingual systems but it r...
Oana Frunza
EMNLP
2004
15 years 8 months ago
A Boosting Algorithm for Classification of Semi-Structured Text
The focus of research in text classification has expanded from simple topic identification to more challenging tasks such as opinion/modality identification. Unfortunately, the la...
Taku Kudo, Yuji Matsumoto