Sciweavers

583 search results - page 46 / 117
» Automatic extraction of titles from general documents using ...
Sort
View
PAKDD
2001
ACM
157views Data Mining» more  PAKDD 2001»
15 years 10 months ago
Applying Pattern Mining to Web Information Extraction
Information extraction (IE) from semi-structured Web documents is a critical issue for information integration systems on the Internet. Previous work in wrapper induction aim to so...
Chia-Hui Chang, Shao-Chen Lui, Yen-Chin Wu
ANLP
1992
116views more  ANLP 1992»
15 years 7 months ago
Automatic Learning for Semantic Collocation
The real di culty in development of practical NLP systems comes from the fact that we do not have e ective means for gathering \knowledge". In this paper, we propose an algor...
Satoshi Sekine, Jeremy J. Carroll, Sophia Ananiado...
LREC
2008
135views Education» more  LREC 2008»
15 years 7 months ago
Communicating Unknown Words in Machine Translation
A new approach to handle unknown words in machine translation is presented. The basic idea is to find definitions for the unknown words on the source language side and translate t...
Matthias Eck, Stephan Vogel, Alex Waibel
ISI
2006
Springer
15 years 6 months ago
Analyzing Entities and Topics in News Articles Using Statistical Topic Models
Statistical language models can learn relationships between topics discussed in a document collection and persons, organizations and places mentioned in each document. We present a...
David Newman, Chaitanya Chemudugunta, Padhraic Smy...
ICAIL
2009
ACM
15 years 10 months ago
Segmentation of legal documents
An overwhelming number of legal documents is available in digital form. However, most of the texts are usually only provided in a semi-structured form, i.e. the documents are stru...
Eneldo Loza Mencía