Sciweavers

11538 search results - page 429 / 2308
» From Text to Knowledge
Sort
View
ICDAR
2003
IEEE
16 years 4 days ago
Estimating Degradation Model Parameters from Character Images
This paper discusses the use of character images to determine the parameters of an image degradation model. The acute angles in character images provide information used to find ...
Hok Sum Yam, Elisa H. Barney Smith
CIKM
2008
Springer
15 years 8 months ago
Coreex: content extraction from online news articles
We developed and tested a heuristic technique for extracting the main article from news site Web pages. We construct the DOM tree of the page and score every node based on the amo...
Jyotika Prasad, Andreas Paepcke
ACL
2003
15 years 8 months ago
Automatic Collection of Related Terms from the Web
This paper proposes a method of collecting a dozen terms that are closely related to a given seed term. The proposed method consists of three steps. The first step, compiling cor...
Satoshi Sato, Yasuhiro Sasaki
ANLP
1994
104views more  ANLP 1994»
15 years 8 months ago
Language Determination: Natural Language Processing from Scanned Document Images
Many documents are available to a computer only as images from paper. However, most natural language processing systems expect their input as character-coded text, which may be di...
Penelope Sibun, A. Lawrence Spitz
ACL
2009
15 years 4 months ago
Accurate Learning for Chinese Function Tags from Minimal Features
Data-driven function tag assignment has been studied for English using Penn Treebank data. In this paper, we address the question of whether such method can be applied to other la...
Caixia Yuan, Fuji Ren, Xiaojie Wang