Sciweavers

1550 search results - page 191 / 310
» Evaluating Document Clustering for Interactive Information R...
Sort
View
CIKM
2008
Springer
15 years 8 months ago
Semi-supervised text categorization by active search
In automated text categorization, given a small number of labeled documents, it is very challenging, if not impossible, to build a reliable classifier that is able to achieve high...
Zenglin Xu, Rong Jin, Kaizhu Huang, Michael R. Lyu...
SIGIR
2010
ACM
15 years 6 months ago
Ontology-enriched multi-document summarization in disaster management
In this poster, we propose a novel document summarization approach named Ontology-enriched M ulti-Document Summarization(OMS) for utilizing background knowledge to improve summari...
Lei Li, Dingding Wang, Chao Shen, Tao Li
CLEF
2010
Springer
15 years 6 months ago
Creating a Persian-English Comparable Corpus
Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs. However the Persian language does not have rich multi...
Homa Baradaran Hashemi, Azadeh Shakery, Heshaam Fe...
WWW
2005
ACM
16 years 7 months ago
Three-level caching for efficient query processing in large Web search engines
Large web search engines have to answer thousands of queries per second with interactive response times. Due to the sizes of the data sets involved, often in the range of multiple...
Xiaohui Long, Torsten Suel
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
16 years 19 days ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...