Sciweavers

2662 search results - page 299 / 533
» Data Compression Support in Databases
Sort
View
ICDE
2010
IEEE
204views Database» more  ICDE 2010»
16 years 1 months ago
ProbClean: A probabilistic duplicate detection system
— One of the most prominent data quality problems is the existence of duplicate records. Current data cleaning systems usually produce one clean instance (repair) of the input da...
George Beskales, Mohamed A. Soliman, Ihab F. Ilyas...
DASFAA
2010
IEEE
189views Database» more  DASFAA 2010»
15 years 11 months ago
Peer-to-Peer Similarity Search Based on M-Tree Indexing
Similarity search in metric spaces has several important applications both in centralized and distributed environments. In centralized applications, such as similarity-based image ...
Akrivi Vlachou, Christos Doulkeridis, Yannis Kotid...
DEXAW
2002
IEEE
107views Database» more  DEXAW 2002»
15 years 11 months ago
Semi-Automated Extraction of Ontological Knowledge from XML Datasources
In the paper we present a methodology for the semiautomated extraction of ontological knowledge from XML data sources in a given domain. We consider an interconnection scenario ov...
Silvana Castano, Valeria De Antonellis, Sabrina De...
VLDB
1987
ACM
109views Database» more  VLDB 1987»
15 years 10 months ago
The Design of the POSTGRES Storage System
This paper presents the design of the storage system for the POSTGRES data base system under construction at Berkeley. It is novel in several ways. First, the storage manager supp...
Michael Stonebraker
SIGMOD
2010
ACM
155views Database» more  SIGMOD 2010»
15 years 5 months ago
Querying RDF streams with C-SPARQL
Continuous SPARQL (C-SPARQL) is a new language for continuous queries over streams of RDF data. CSPARQL queries consider windows, i.e., the most recent triples of such streams, ob...
Davide Francesco Barbieri, Daniele Braga, Stefano ...