— One of the most prominent data quality problems is the existence of duplicate records. Current data cleaning systems usually produce one clean instance (repair) of the input da...
George Beskales, Mohamed A. Soliman, Ihab F. Ilyas...
— We study the following problem: A data distributor has given sensitive data to a set of supposedly trusted agents (third parties). Some of the data is leaked and found in an un...
Bitmap indexes are widely used in Decision Support Systems (DSSs) to improve query performance. In this paper, we evaluate the use of compressed inverted indexes with adapted quer...
In the light of rapidly growing repositories capturing the movement trajectories of people in spacetime, the need for trajectory compression becomes obvious. This paper argues for ...
ELKI is a unied software framework, designed as a tool suitable for evaluation of dierent algorithms on high dimensional realvalued feature-vectors. A special case of high dimens...
Elke Achtert, Thomas Bernecker, Hans-Peter Kriegel...