Sciweavers

16568 search results - page 402 / 3314
» Structured Data on the Web
Sort
View
CIKM
2003
Springer
16 years 1 days ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
DL
1998
Springer
110views Digital Library» more  DL 1998»
15 years 11 months ago
Failure Analysis in Query Construction: Data and Analysis from a Large Sample of Web Queries
This paper reports results from a failure analysis (i.e., incorrect query construction) of 51,473 queries from 18,113 users of Excite, a major Web search engine. Given that many d...
Bernard J. Jansen, Amanda Spink, Tefko Saracevic
ACL
2006
15 years 8 months ago
A DOM Tree Alignment Model for Mining Parallel Data from the Web
This paper presents a new web mining scheme for parallel data acquisition. Based on the Document Object Model (DOM), a web page is represented as a DOM tree. Then a DOM tree align...
Lei Shi, Cheng Niu, Ming Zhou, Jianfeng Gao
IM
2007
15 years 6 months ago
Approximating Personalized PageRank with Minimal Use of Web Graph Data
Abstract. In this paper, we consider the problem of calculating fast and accurate approximations to the personalized PageRank score of a webpage. We focus on techniques to improve ...
David Gleich, Marzia Polito
WWW
2006
ACM
16 years 7 months ago
Upgrading relational legacy data to the semantic web
In this poster, we describe a framework composed of the R2O mapping language and the ODEMapster processor to upgrade relational legacy data to the Semantic Web. The framework is b...
Asunción Gómez-Pérez, Jes&uac...