Search Sciweavers | Sciweavers

7495 search results - page 337 / 1499

» Intelligent Document Processing

198

click to vote

ITCC
2003
IEEE

96views Information Technology» more ITCC 2003»

A Method for Calculating Term Similarity on Large Document Collections

15 years 12 months ago

Download www.isri.unlv.edu

We present an efﬁcient algorithm called the Quadtree Heuristic for identifying a list of similar terms for each unique term in a large document collection. Term similarity is de...

Wolfgang W. Bein, Jeffrey S. Coombs, Kazem Taghva

claim paper

Read More »

160

click to vote

DOCENG
2003
ACM

160views Document Analysis» more DOCENG 2003»

Creating reusable well-structured PDF as a sequence of component object graphic (COG) elements

15 years 12 months ago

Download eprints.nottingham.ac.uk

Portable Document Format (PDF) is a page-oriented, graphically rich format based on PostScript semantics and it is also the format interpreted by the Adobe Acrobat viewers. Althou...

Steven R. Bagley, David F. Brailsford, Matthew R. ...

claim paper

Read More »

147

click to vote

ICDAR
2003
IEEE

133views Document Analysis» more ICDAR 2003»

A Character Recognizer for Turkish Language

16 years 7 hour ago

Download www.cse.salford.ac.uk

This paper presents particularly a contextual post processing subsystem for a Turkish machine printed character recognition system. The contextual post processing subsystem is bas...

Sait Ulas Korkmaz, G. Kirçiçegi, Y. ...

claim paper

Read More »

208

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

Structured entity identification and document categorization: two tasks with one joint model

16 years 7 months ago

Download www.godbole.net

Traditionally, research in identifying structured entities in documents has proceeded independently of document categorization research. In this paper, we observe that these two t...

Indrajit Bhattacharya, Shantanu Godbole, Sachindra...

claim paper

Read More »

187

Voted

KDD
2007
ACM

186views Data Mining» more KDD 2007»

Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus

16 years 7 months ago

Download www.ssrc.ucsc.edu

We present a document routing and index partitioning scheme for scalable similarity-based search of documents in a large corpus. We consider the case when similarity-based search ...

Deepavali Bhagwat, Kave Eshghi, Pankaj Mehra

claim paper

Read More »

« Prev « First page 337 / 1499 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers