Sciweavers

7495 search results - page 297 / 1499
» Intelligent Document Processing
Sort
View
148
Voted
IR
2007
15 years 6 months ago
Regularizing query-based retrieval scores
In information retrieval, the cluster hypothesis states: closely related documents tend to be relevant to the same request. We exploit this hypothesis directly by adjusting queryb...
Fernando Diaz
CATA
2004
15 years 8 months ago
Annotating Linguistic Data with ImageSpace for the Preservation of Endangered Languages
Many languages are in serious danger of being lost and as a result, there has been a significant increase in language documentation projects, and also in attempts to preserve lang...
Shiyong Lu, Rong Huang, Farshad Fotouhi
TREC
2003
15 years 8 months ago
Active Feedback - UIUC TREC-2003 HARD Experiments
In this paper, we report our experiments on the HARD (High Accuracy Retrieval from Documents) Track in TREC 2003. We focus on active feedback, i.e., how to intelligently propose q...
Xuehua Shen, ChengXiang Zhai
CIKM
2003
Springer
15 years 12 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
ICIP
2000
IEEE
15 years 11 months ago
Hough Technique for Bar Charts Detection and Recognition in Document Images
Charts are common graphic representation for scientific data in technical and business papers. We present a robust system for detecting and recognizing bar charts. The system incl...
Yan Ping Zhou, Chew Lim Tan