Sciweavers

7495 search results - page 294 / 1499
» Intelligent Document Processing
Sort
View
IJCNLP
2004
Springer
16 years 23 hour ago
Combining Labeled and Unlabeled Data for Learning Cross-Document Structural Relationships
Multi-document discourse analysis has emerged with the potential of improving various NLP applications. Based on the newly proposed Cross-document Structure Theory (CST), this pap...
Zhu Zhang, Dragomir R. Radev
DOCENG
2003
ACM
15 years 12 months ago
Improving formatting documents by coupling formatting systems
In this paper, we present a framework for coupling an existing formatting system such as SMIL [7] and Madeus [13] with a formatting control system XEF [10]. This framework allows ...
Fateh Boulmaiz, Cécile Roisin, Fréd&...
ICPR
2000
IEEE
15 years 11 months ago
Automatic Ground-Truth Generation for Skew-Tolerance Evaluation of Document Layout Analysis Methods
Generation of ground-truths is of great importance for unbiased performance evaluation of document layout analysis methods. This is especially necessary because many methods are c...
Oleg Okun, Matti Pietikäinen
EDBTW
2006
Springer
15 years 8 months ago
Efficient Integrity Checking over XML Documents
The need for incremental constraint maintenance within collections of semi-structured documents has been ever increasing in the last years due to the widespread diffusion of XML. T...
Daniele Braga, Alessandro Campi, Davide Martinengh...
IJCAI
2001
15 years 8 months ago
Combining Statistics and Semantics for Word and Document Clustering
A new approach for constructing pseudo-keywords, referred to as Sense Units, is proposed. Sense Units are obtained by a word clustering process, where the underlying similarity re...
Alexandre Termier, Michèle Sebag, Marie-Chr...