Sciweavers

3907 search results - page 239 / 782
» Information requirements engineering for data warehouse syst...
Sort
View
ACL
2007
15 years 8 months ago
Sparse Information Extraction: Unsupervised Language Models to the Rescue
Even in a massive corpus such as the Web, a substantial fraction of extractions appear infrequently. This paper shows how to assess the correctness of sparse extractions by utiliz...
Doug Downey, Stefan Schoenmackers, Oren Etzioni
DEXAW
2010
IEEE
149views Database» more  DEXAW 2010»
15 years 7 months ago
Using Progressive Filtering to Deal with Information Overload
Abstract-- In the age of Web 2.0 people organize large collections of web pages, articles, or emails in hierarchies of topics, or arrange a large body of knowledge in ontologies. T...
Andrea Addis, Giuliano Armano, Eloisa Vargiu
ICEIS
2009
IEEE
16 years 1 months ago
Semi-supervised Information Extraction from Variable-length Web-page Lists
We propose two methods for constructing automated programs for extraction of information from a class of web pages that are very common and of high practical significance - varia...
Daniel Nikovski, Alan Esenther, Akihiro Baba
IJCAI
2003
15 years 8 months ago
Statistics Gathering for Learning from Distributed, Heterogeneous and Autonomous Data Sources
With the growing use of distributed information networks, there is an increasing need for algorithmic and system solutions for data-driven knowledge acquisition using distributed,...
Doina Caragea, Jaime Reinoso, Adrian Silvescu, Vas...
KDD
2009
ACM
248views Data Mining» more  KDD 2009»
15 years 11 months ago
PSkip: estimating relevance ranking quality from web search clickthrough data
1 In this article, we report our efforts in mining the information encoded as clickthrough data in the server logs to evaluate and monitor the relevance ranking quality of a commer...
Kuansan Wang, Toby Walker, Zijian Zheng