Sciweavers

4645 search results - page 649 / 929
» Using Information Extraction to Improve Document Retrieval
Sort
View
KDD
2008
ACM
156views Data Mining» more  KDD 2008»
16 years 7 months ago
Unsupervised deduplication using cross-field dependencies
Recent work in deduplication has shown that collective deduplication of different attribute types can improve performance. But although these techniques cluster the attributes col...
Robert Hall, Charles A. Sutton, Andrew McCallum
CLEF
2009
Springer
15 years 7 months ago
Approaching Question Answering by Means of Paragraph Validation
In this paper we describe the system we developed for taking part in monolingual Spanish and English tasks at ResPubliQA 2009. Our system was composed by an IR phase focused on im...
Álvaro Rodrigo, Joaquín Pérez...
COLING
2010
15 years 1 months ago
An Empirical Study on Web Mining of Parallel Data
This paper1 presents an empirical approach to mining parallel corpora. Conventional approaches use a readily available collection of comparable, nonparallel corpora to extract par...
Gum-Won Hong, Chi-Ho Li, Ming Zhou, Hae-Chang Rim
SIGIR
2000
ACM
15 years 11 months ago
OCELOT: a system for summarizing Web pages
Abstract We introduce OCELOT, a prototype system for automatically generating the “gist” of a web page by summarizing it. Although most text summarization research to date has ...
Adam L. Berger, Vibhu O. Mittal
COLING
2010
15 years 1 months ago
Efficient Statement Identification for Automatic Market Forecasting
Strategic business decision making involves the analysis of market forecasts. Today, the identification and aggregation of relevant market statements is done by human experts, oft...
Henning Wachsmuth, Peter Prettenhofer, Benno Stein