Sciweavers

2827 search results - page 260 / 566
» Marking Text Documents
Sort
View
ICDAR
2003
IEEE
15 years 12 months ago
A Model-based Line Detection Algorithm in Documents
In this paper we present a novel model based approach to detect severely broken parallel lines in noisy textual documents. It is important to detect and remove these lines so the ...
Yefeng Zheng, Huiping Li, David S. Doermann
WIDM
2003
ACM
15 years 11 months ago
Clustering documents in a web directory
Hierarchical categorization of documents is a task receiving growing interest due to the widespread proliferation of topic hierarchies for text documents. The worst problem of hie...
Giordano Adami, Paolo Avesani, Diego Sona
WWW
2003
ACM
15 years 11 months ago
The V2 Temporal Document Database System
Support for temporal text-containment queries (query for all versions of documents that contained one or more particular words at a particular time t) is of interest in a number of...
Kjetil Nørvåg
ADC
2003
Springer
115views Database» more  ADC 2003»
15 years 10 months ago
Document Classification via Structure Synopses
Information available in the Internet is frequently supplied simply as plain ascii text, structured according to orthographic and semantic conventions. Traditional document classi...
Liping Ma, John Shepherd, Anh Nguyen
DIAL
2006
IEEE
146views Image Analysis» more  DIAL 2006»
15 years 8 months ago
Distance Measures for Layout-Based Document Image Retrieval
Most methods for document image retrieval rely solely on text information to find similar documents. This paper describes a way to use layout information for document image retrie...
Joost van Beusekom, Daniel Keysers, Faisal Shafait...