Sciweavers

4645 search results - page 219 / 929
» Using Information Extraction to Improve Document Retrieval
Sort
View
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
16 years 1 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
KES
2008
Springer
15 years 6 months ago
Data Mining for Navigation Generating System with Unorganized Web Resources
Users prefer to navigate subjects from organized topics in an abundance resources than to list pages retrieved from search engines. We propose a framework to cluster frequent items...
Diana Purwitasari, Yasuhisa Okazaki, Kenzi Watanab...
CLEF
2005
Springer
16 years 4 days ago
BulQA: Bulgarian-Bulgarian Question Answering at CLEF 2005
This paper describes the architecture of a Bulgarian–Bulgarian question answering system — BulQA. The system relies on a partially parsed corpus for answer extraction. The que...
Kiril Ivanov Simov, Petya Osenova
ACL
2008
15 years 8 months ago
Credibility Improves Topical Blog Post Retrieval
Topical blog post retrieval is the task of ranking blog posts with respect to their relevance for a given topic. To improve topical blog post retrieval we incorporate textual cred...
Wouter Weerkamp, Maarten de Rijke
IJAIT
2007
108views more  IJAIT 2007»
15 years 6 months ago
Document Retrieval by Projection Based Frequency Distribution
In document retrieval task, random projection (RP) is a useful technique of dimension reduction. It can be obtained very quickly yet the recalculation is not necessary to any chang...
Isamu Shioya, Hirohito Oh'uchi, Takao Miura