Sciweavers

2656 search results - page 289 / 532
» Compacting XML Data
Sort
View
HPDC
2010
IEEE
15 years 7 months ago
Massive Semantic Web data compression with MapReduce
The Semantic Web consists of many billions of statements made of terms that are either URIs or literals. Since these terms usually consist of long sequences of characters, an effe...
Jacopo Urbani, Jason Maassen, Henri E. Bal
INFOCOM
2010
IEEE
15 years 4 months ago
Efficient Similarity Estimation for Systems Exploiting Data Redundancy
Many modern systems exploit data redundancy to improve efficiency. These systems split data into chunks, generate identifiers for each of them, and compare the identifiers among ot...
Kanat Tangwongsan, Himabindu Pucha, David G. Ander...
IDEAS
2002
IEEE
125views Database» more  IDEAS 2002»
15 years 11 months ago
Integrating HTML Tables Using Semantic Hierarchies And Meta-Data Sets
As the Internet is a global network, there is a demand on accessing closely related data without browsing through di erent Web documents. A signi cant amount of these data are pre...
Seung Jin Lim, Yiu-Kai Ng, Xiaochun Yang
SBBD
2004
128views Database» more  SBBD 2004»
15 years 8 months ago
Integrating Heterogeneous Data Sources in Flexible and Dynamic Environments
Flexible and dynamic environments are characterized by high independence from connection participants, low control over available services and high tolerance to communication fail...
Angelo Brayner, Marcelo Meirelles
ICDE
2010
IEEE
208views Database» more  ICDE 2010»
15 years 6 months ago
Duplicate detection in probabilistic data
Abstract— Collected data often contains uncertainties. Probabilistic databases have been proposed to manage uncertain data. To combine data from multiple autonomous probabilistic...
Fabian Panse, Maurice van Keulen, Ander de Keijzer...