Search Sciweavers | Sciweavers

187

HPDC
2010
IEEE

196views Distributed And Parallel Com...» more HPDC 2010»

Massive Semantic Web data compression with MapReduce

15 years 7 months ago

The Semantic Web consists of many billions of statements made of terms that are either URIs or literals. Since these terms usually consist of long sequences of characters, an effe...

Jacopo Urbani, Jason Maassen, Henri E. Bal

claim paper

Read More »

145

click to vote

INFOCOM
2010
IEEE

158views Communications» more INFOCOM 2010»

15 years 4 months ago

Efficient Similarity Estimation for Systems Exploiting Data Redundancy

Download www.cs.cmu.edu

Many modern systems exploit data redundancy to improve efficiency. These systems split data into chunks, generate identifiers for each of them, and compare the identifiers among ot...

Kanat Tangwongsan, Himabindu Pucha, David G. Ander...

claim paper

Read More »

177

click to vote

IDEAS
2002
IEEE

125views Database» more IDEAS 2002»

Integrating HTML Tables Using Semantic Hierarchies And Meta-Data Sets

15 years 11 months ago

Download students.cs.byu.edu

As the Internet is a global network, there is a demand on accessing closely related data without browsing through di erent Web documents. A signi cant amount of these data are pre...

Seung Jin Lim, Yiu-Kai Ng, Xiaochun Yang

claim paper

Read More »

159

click to vote

SBBD
2004

128views Database» more SBBD 2004»

Integrating Heterogeneous Data Sources in Flexible and Dynamic Environments

15 years 8 months ago

Download www.lbd.dcc.ufmg.br

Flexible and dynamic environments are characterized by high independence from connection participants, low control over available services and high tolerance to communication fail...

Angelo Brayner, Marcelo Meirelles

claim paper

Read More »

185

click to vote

ICDE
2010
IEEE

208views Database» more ICDE 2010»

Duplicate detection in probabilistic data

15 years 6 months ago

Download eprints.eemcs.utwente.nl

Abstract— Collected data often contains uncertainties. Probabilistic databases have been proposed to manage uncertain data. To combine data from multiple autonomous probabilistic...

Fabian Panse, Maurice van Keulen, Ander de Keijzer...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers