Sciweavers

2635 search results - page 219 / 527
» Algorithms for the Database Layout Problem
Sort
View
ICDE
2010
IEEE
204views Database» more  ICDE 2010»
16 years 1 months ago
ProbClean: A probabilistic duplicate detection system
— One of the most prominent data quality problems is the existence of duplicate records. Current data cleaning systems usually produce one clean instance (repair) of the input da...
George Beskales, Mohamed A. Soliman, Ihab F. Ilyas...
ICDE
2005
IEEE
108views Database» more  ICDE 2005»
16 years 5 days ago
Robust Identification of Fuzzy Duplicates
Detecting and eliminating fuzzy duplicates is a critical data cleaning task that is required by many applications. Fuzzy duplicates are multiple seemingly distinct tuples which re...
Surajit Chaudhuri, Venkatesh Ganti, Rajeev Motwani
VLDB
2005
ACM
114views Database» more  VLDB 2005»
16 years 1 days ago
Checking for k-Anonymity Violation by Views
When a private relational table is published using views, secrecy or privacy may be violated. This paper uses a formally-defined notion of k-anonymity to measure disclosure by vi...
Chao Yao, Xiaoyang Sean Wang, Sushil Jajodia
GECCO
2004
Springer
135views Optimization» more  GECCO 2004»
15 years 12 months ago
CellNet Co-Ev: Evolving Better Pattern Recognizers Using Competitive Co-evolution
A model for the co-evolution of patterns and classifiers is presented. The CellNet system for generating binary classifiers is used as a base for experimentation. The CellNet syste...
Taras Kowaliw, Nawwaf N. Kharma, Chris Jensen, Hus...
ADBIS
2000
Springer
88views Database» more  ADBIS 2000»
15 years 11 months ago
Finding Generalized Path Patterns for Web Log Data Mining
Conducting data mining on logs of web servers involves the determination of frequently occurring access sequences. We examine the problem of finding traversal patterns from web lo...
Alexandros Nanopoulos, Yannis Manolopoulos