Sciweavers

8316 search results - page 369 / 1664
» Web Document Modeling
Sort
View
ICEBE
2006
IEEE
140views Business» more  ICEBE 2006»
16 years 24 days ago
Achieving Transparent Integration of Information, Documents and Processes
Business interoperation is important especially in electronic business. It requires the integration of business information, business documents and business processes. Nevertheles...
Jingzhi Guo
SIGIR
2000
ACM
15 years 11 months ago
An investigation of linguistic features and clustering algorithms for topical document clustering
We investigate four hierarchical clustering methods (single-link, complete-link, groupwise-average, and single-pass) and two linguistically motivated text features (noun phrase he...
Vasileios Hatzivassiloglou, Luis Gravano, Ankineed...
SIGIR
2011
ACM
14 years 9 months ago
When documents are very long, BM25 fails!
We reveal that the Okapi BM25 retrieval function tends to overly penalize very long documents. To address this problem, we present a simple yet effective extension of BM25, namel...
Yuanhua Lv, ChengXiang Zhai
EMNLP
2010
15 years 4 months ago
Collective Cross-Document Relation Extraction Without Labelled Data
We present a novel approach to relation extraction that integrates information across documents, performs global inference and requires no labelled text. In particular, we tackle ...
Limin Yao, Sebastian Riedel, Andrew McCallum
ICIP
2008
IEEE
16 years 1 months ago
Iterative pre- and post-processing for MRC layers of scanned documents
The Mixed Raster Content (MRC) document compression standard (ITU T.44) specifies a multi-layer multi-resolution representation of a compound document. The model is very efficie...
Alexandre Zaghetto, Ricardo L. de Queiroz