Sciweavers

2929 search results - page 226 / 586
» Models of English Text
Sort
View
ICDE
2006
IEEE
428views Database» more  ICDE 2006»
16 years 7 months ago
Integrating Unstructured Data into Relational Databases
In this paper we present a system for automatically integrating unstructured text into a multi-relational database using state-of-the-art statistical models for structure extracti...
Imran R. Mansuri, Sunita Sarawagi
ENC
2005
IEEE
16 years 4 days ago
Combining Structural and Textual Contexts for Compressing Semistructured Databases
We describe a compression technique for semistructured documents, called SCMPPM, which combines the Prediction by Partial Matching technique with Structural Contexts Model (SCM) t...
Joaquín Adiego, Pablo de la Fuente, Gonzalo...
CLEF
2008
Springer
15 years 8 months ago
Back to Basics - Again - for Domain-Specific Retrieval
In this paper we will describe Berkeley's approach to the Domain Specific (DS) track for CLEF 2008. Last year we used Entry Vocabulary Indexes and Thesaurus expansion approac...
Ray R. Larson
EMNLP
2009
15 years 4 months ago
Cross-lingual Semantic Relatedness Using Encyclopedic Knowledge
In this paper, we address the task of crosslingual semantic relatedness. We introduce a method that relies on the information extracted from Wikipedia, by exploiting the interlang...
Samer Hassan, Rada Mihalcea
KDD
2009
ACM
156views Data Mining» more  KDD 2009»
16 years 7 months ago
Effective multi-label active learning for text classification
Labeling text data is quite time-consuming but essential for automatic text classification. Especially, manually creating multiple labels for each document may become impractical ...
Bishan Yang, Jian-Tao Sun, Tengjiao Wang, Zheng Ch...