Sciweavers

3530 search results - page 243 / 706
» Technology of Text Mining
Sort
View
SIGIR
2004
ACM
15 years 12 months ago
Parameterized generation of labeled datasets for text categorization based on a hierarchical directory
Although text categorization is a burgeoning area of IR research, readily available test collections in this field are surprisingly scarce. We describe a methodology and system (...
Dmitry Davidov, Evgeniy Gabrilovich, Shaul Markovi...
CIKM
1999
Springer
15 years 10 months ago
A Method of Geographical Name Extraction from Japanese Text for Thematic Geographical Search
A text retrieval method called the thematic geographical search method has been developed and applied to a Japanese encyclopedia called the World Encyclopædia. In this method, th...
Yasusi Kanada
SIGIR
1998
ACM
15 years 10 months ago
Boosting and Rocchio Applied to Text Filtering
We discuss two learning algorithms for text filtering: modified Rocchio and a boosting algorithm called AdaBoost. We show how both algorithms can be adapted to maximize any gene...
Robert E. Schapire, Yoram Singer, Amit Singhal
SPIRE
1998
Springer
15 years 10 months ago
An Experiment Stemming Non-Traditional Text
Stemming is a technique which aims to extract common suffixes of words. Thus, words which are literally differhave a common stem, may be abstracted by their common stem. The under...
Mario A. Nascimento, Adriano C. R. da Cunha
AIRS
2006
Springer
15 years 10 months ago
Learning to Separate Text Content and Style for Classification
Many text documents naturally have two kinds of labels. For example, we may label web pages from universities according to their categories, such as "student" or "fa...
Dell Zhang, Wee Sun Lee