Sciweavers

11538 search results - page 254 / 2308
» From Text to Knowledge
Sort
View
EACL
2006
ACL Anthology
15 years 8 months ago
Multilingual Term Extraction from Domain-specific Corpora Using Morphological Structure
Morphologically complex terms composed from Greek or Latin elements are frequent in scientific and technical texts. Word forming units are thus relevant cues for the identificatio...
Delphine Bernhard
IJCAI
2003
15 years 8 months ago
Information Extraction from Tree Documents by Learning Subtree Delimiters
Information extraction from HTML pages has been conventionally treated as plain text documents extended with HTML tags. However, the growing maturity and correct usage of HTML/XHT...
Boris Chidlovskii
ICDE
2006
IEEE
124views Database» more  ICDE 2006»
16 years 8 months ago
Segmentation of Publication Records of Authors from the Web
Publication records are often found in the authors' personal home pages. If such a record is partitioned into a list of semantic fields of authors, title, date, etc., the uns...
Wei Zhang, Clement T. Yu, Neil R. Smalheiser, Vetl...
PROPOR
2010
Springer
278views Languages» more  PROPOR 2010»
16 years 1 months ago
Translating from Complex to Simplified Sentences
We address the problem of simplifying Portuguese texts at the sentence level treating it as a "translation task". We use the Statistical Machine Translation (SMT) framewo...
Lucia Specia
HICSS
2005
IEEE
150views Biometrics» more  HICSS 2005»
16 years 6 days ago
What are the Characteristics of Digital Genres? - Genre Theory from a Multi-Modal Perspective
This paper explores the possibility of extending the functional genre analysis model to account for the genre characteristics of non-linear, multi-modal, webmediated documents. Th...
Inger Askehave, Anne Ellerup Nielsen