Sciweavers

874 search results - page 40 / 175
» Jedi: Extracting and Synthesizing Information from the Web
Sort
View
HT
2009
ACM
15 years 3 months ago
Retrieving broken web links using an approach based on contextual information
In this short note we present a recommendation system for automatic retrieval of broken Web links using an approach based on contextual information. We extract information from th...
Juan Martinez-Romo, Lourdes Araujo
ADC
2006
Springer
130views Database» more  ADC 2006»
16 years 3 days ago
A two-phase rule generation and optimization approach for wrapper generation
Web information extraction is a fundamental issue for web information management and integrations. A common approach is to use wrappers to extract data from web pages or documents...
Yanan Hao, Yanchun Zhang
AAAI
2008
15 years 8 months ago
Automatic Extraction of Data Points and Text Blocks from 2-Dimensional Plots in Digital Documents
Two dimensional plots (2-D) in digital documents on the web are an important source of information that is largely under-utilized. In this paper, we outline how data and text can ...
Saurabh Kataria, William Browuer, Prasenjit Mitra,...
WWW
2004
ACM
16 years 6 months ago
Automatic extraction of web search interfaces for interface schema integration
This paper provides an overview of a technique for extracting information from the Web search interfaces of e-commerce search engines that is useful for supporting automatic searc...
Hai He, Weiyi Meng, Clement T. Yu, Zonghuan Wu
ICDM
2006
IEEE
164views Data Mining» more  ICDM 2006»
16 years 4 days ago
Unsupervised Learning of Tree Alignment Models for Information Extraction
We propose an algorithm for extracting fields from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...
Philip Zigoris, Damian Eads, Yi Zhang