— One of the most prominent data quality problems is the existence of duplicate records. Current data cleaning systems usually produce one clean instance (repair) of the input da...
George Beskales, Mohamed A. Soliman, Ihab F. Ilyas...
Similarity search in metric spaces has several important applications both in centralized and distributed environments. In centralized applications, such as similarity-based image ...
In the paper we present a methodology for the semiautomated extraction of ontological knowledge from XML data sources in a given domain. We consider an interconnection scenario ov...
Silvana Castano, Valeria De Antonellis, Sabrina De...
This paper presents the design of the storage system for the POSTGRES data base system under construction at Berkeley. It is novel in several ways. First, the storage manager supp...
Continuous SPARQL (C-SPARQL) is a new language for continuous queries over streams of RDF data. CSPARQL queries consider windows, i.e., the most recent triples of such streams, ob...
Davide Francesco Barbieri, Daniele Braga, Stefano ...