Sciweavers

7495 search results - page 226 / 1499
» Intelligent Document Processing
Sort
View
ICDAR
1997
IEEE
15 years 10 months ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari
BTW
2007
Springer
127views Database» more  BTW 2007»
16 years 22 days ago
An Adaptive Storage Manager for XML Documents
Abstract. Effective and efficient management and manipulation of XML documents requires stable decisions at the time a document enters the XML DBMS to provide for storage structure...
Karsten Schmidt 0002, Theo Härder
SAMT
2007
Springer
108views Multimedia» more  SAMT 2007»
16 years 20 days ago
Document Layout Substructure Discovery
Abstract. In this paper we present a system, DoLSuD, for the automatic discovery of relevant substructures in a document layout. DoLSuD, Document Layout Substructure Discovery, ext...
Claudio Andreatta
ICDAR
2003
IEEE
15 years 12 months ago
Reference Line Extraction from Form Documents with Complicated Backgrounds
Form document analysis is one of the most essential tasks in document analysis and recognition. One of the most fundamental and crucial tasks is the extraction of the reference li...
Dihua Xi, Seong-Whan Lee
ECIR
2003
Springer
15 years 8 months ago
Hierarchical Classification of HTML Documents with WebClassII
This paper describes a new method for the classification of a HTML document into a hierarchy of categories. The hierarchy of categories is involved in all phases of automated docum...
Michelangelo Ceci, Donato Malerba