Sciweavers

8479 search results - page 269 / 1696
» Data Extraction from Web Data Sources
Sort
View
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
16 years 1 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
16 years 7 months ago
Web site mining: a new way to spot competitors, customers and suppliers in the world wide web
When automatically extracting information from the world wide web, most established methods focus on spotting single HTMLdocuments. However, the problem of spotting complete web s...
Martin Ester, Hans-Peter Kriegel, Matthias Schuber...
201
Voted
ICDE
2010
IEEE
272views Database» more  ICDE 2010»
16 years 6 months ago
SMARTINT: A System for Answering Queries over Web Databases Using Attribute Dependencies
Many web databases can be seen as providing partial and overlapping information about entities in the world. To answer queries effectively, we need to integrate the information abo...
Ravi Gummadi, Anupam Khulbe, Aravind Kalavagattu, ...
LREC
2010
180views Education» more  LREC 2010»
15 years 8 months ago
Collecting Voices from the Cloud
The collection and transcription of speech data is typically an expensive and time-consuming task. Voice over IP and cloud computing are poised to greatly reduce this impediment t...
Ian McGraw, Chia-ying Lee, I. Lee Hetherington, St...
ERLANG
2006
ACM
16 years 18 days ago
From HTTP to HTML: Erlang/OTP experiences in web based service applications
This paper describes the lessons learnt when internally developing web applications in Erlang. On the basis of these experiences, a framework called the Web Platform has been impl...
Francesco Cesarini, Lukas Larsson, Michal Slaski