Sciweavers

7495 search results - page 273 / 1499
» Intelligent Document Processing
Sort
View
WIRI
2005
IEEE
16 years 6 days ago
Postal Address Detection from Web Documents
An approach to postal address detection from webpages is proposed. The webpages are first segmented into text blocks based on their visual similarity. The text content in each bl...
Lin Can, Zhang Qian, Xiaofeng Meng, Wenyin Lin
RIAO
2007
15 years 8 months ago
Smart Qualitative Data (SQUAD): Information Extraction in a Large Document Archive
In this paper, we present the results of an investigation into methodologies and technical solutions for exposing the structured metadata contained within digital qualitative data...
Maria Milosavljevic, Claire Grover, Louise Corti
INTERACT
1997
15 years 8 months ago
What Happened to our Document in the Shared Workspace? The Need for Groupware Conventions
Conventions for conducting work with groupware are essential. They include rules for how the groupware functionality should be used for communication about work, for how data shoul...
Gloria Mark, Wolfgang Prinz
WWW
2008
ACM
16 years 7 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
CIKM
2004
Springer
16 years 1 days ago
Hierarchical document categorization with support vector machines
Automatically categorizing documents into pre-defined topic hierarchies or taxonomies is a crucial step in knowledge and content management. Standard machine learning techniques ...
Lijuan Cai, Thomas Hofmann