Sciweavers

11538 search results - page 433 / 2308
» From Text to Knowledge
Sort
View
DAS
2006
Springer
15 years 10 months ago
Segmentation-Driven Recognition Applied to Numerical Field Extraction from Handwritten Incoming Mail Documents
Abstract. In this paper, we present a method for the automatic extraction of numerical fields (zip codes, phone numbers, etc.) from incoming mail documents. The approach is based o...
Clément Chatelain, Laurent Heutte, Thierry ...
ECIR
2006
Springer
15 years 8 months ago
Automatic Acquisition of Chinese-English Parallel Corpus from the Web
Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...
Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines
ACL
2003
15 years 8 months ago
Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web
In this paper, we present a method that automatically constructs a Named Entity (NE) tagged corpus from the web to be used for learning of Named Entity Recognition systems. We use...
Joohui An, Seungwoo Lee, Gary Geunbae Lee
CORIA
2009
15 years 8 months ago
Aggregated search: From information nuggets to aggregated documents
The aggregated search assembles in one interface information from different sources. It deals with different types of content (text, video, image, etc) and granularities of retriev...
Arlind Kopliku
JMM2
2007
100views more  JMM2 2007»
15 years 6 months ago
On Separation of English Numerals from Multilingual Document Images
— For Optical Character Recognition (OCR) of bilingual or multilingual document containing text words in regional language and numerals in English, it is necessary to identify di...
Basanna V. Dhandra, Mallikarjun Hangarge