Sciweavers

1486 search results - page 72 / 298
» A Document as a Small World
Sort
View
ICDAR
2003
IEEE
15 years 11 months ago
Automatic Feature Selection with Applications to Script Identification of Degraded Documents
Current approaches to script identification rely on hand-selected features and often require processing a significant part of the document to achieve reliable identification. We p...
Vitaly Ablavsky, Mark R. Stevens
ICDM
2003
IEEE
138views Data Mining» more  ICDM 2003»
15 years 11 months ago
Ontologies Improve Text Document Clustering
Text document clustering plays an important role in providing intuitive navigation and browsing mechanisms by organizing large sets of documents into a small number of meaningful ...
Andreas Hotho, Steffen Staab, Gerd Stumme
RIAO
2007
15 years 7 months ago
Document frequency and term specificity
Document frequency is used in various applications in Information Retrieval and other related fields. An assumption frequently made is that the document frequency represents a lev...
Hideo Joho, Mark Sanderson
ANLP
1994
104views more  ANLP 1994»
15 years 7 months ago
Language Determination: Natural Language Processing from Scanned Document Images
Many documents are available to a computer only as images from paper. However, most natural language processing systems expect their input as character-coded text, which may be di...
Penelope Sibun, A. Lawrence Spitz
SDM
2009
SIAM
108views Data Mining» more  SDM 2009»
16 years 3 months ago
Highlighting Diverse Concepts in Documents.
We show the underpinnings of a method for summarizing documents: it ingests a document and automatically highlights a small set of sentences that are expected to cover the differ...
Evimaria Terzi, Kun Liu, Tyrone Grandison