Sciweavers

2827 search results - page 343 / 566
» Marking Text Documents
Sort
View
ICIP
2009
IEEE
16 years 7 months ago
Recognition Driven Page Orientation Detection
In document image recognition, orientation detection of the scanned page is necessary for the following procedures to work correctly as they assume that the text is well oriented....
WWW
2002
ACM
16 years 7 months ago
Authoring and annotation of web pages in CREAM
Richly interlinked, machine-understandable data constitute the basis for the Semantic Web. We provide a framework, CREAM, that allows for creation of metadata. While the annotatio...
Siegfried Handschuh, Steffen Staab
EDBT
2006
ACM
181views Database» more  EDBT 2006»
16 years 6 months ago
TeNDaX, a Collaborative Database-Based Real-Time Editor System
TeNDaX is a collaborative database-based real-time editor system. TeNDaX is a new approach for word-processing in which documents (i.e. content and structure, tables, images etc.) ...
Klaus R. Dittrich, Michael H. Böhlen, Stefani...
ICDE
2009
IEEE
155views Database» more  ICDE 2009»
16 years 1 months ago
Join Optimization of Information Extraction Output: Quality Matters!
— Information extraction (IE) systems are trained to extract specific relations from text databases. Real-world applications often require that the output of multiple IE systems...
Alpa Jain, Panagiotis G. Ipeirotis, AnHai Doan, Lu...
SEMCO
2007
IEEE
16 years 27 days ago
Intelligent Parsing of Scanned Volumes for Web Based Archives
The proliferation of digital libraries and the large amount of existing documents raise important issues in efficient handling of documents. Printed texts in documents need to be...
Xiaonan Lu, James Ze Wang, C. Lee Giles