Sciweavers

6388 search results - page 304 / 1278
» High Performance Data Mining
Sort
View
IDEAL
2005
Springer
16 years 6 days ago
Probabilistic Data Generation for Deduplication and Data Linkage
Abstract. In many data mining projects the data to be analysed contains personal information, like names and addresses. Cleaning and preprocessing of such data likely involves dedu...
Peter Christen
ICDM
2010
IEEE
201views Data Mining» more  ICDM 2010»
15 years 4 months ago
Mining Closed Strict Episodes
Discovering patterns in a sequence is an important aspect of data mining. One popular choice of such patterns are episodes, patterns in sequential data describing events that often...
Nikolaj Tatti, Boris Cule
PAKDD
2004
ACM
186views Data Mining» more  PAKDD 2004»
16 years 1 days ago
CMTreeMiner: Mining Both Closed and Maximal Frequent Subtrees
Abstract. Tree structures are used extensively in domains such as computational biology, pattern recognition, XML databases, computer networks, and so on. One important problem in ...
Yun Chi, Yirong Yang, Yi Xia, Richard R. Muntz
ICDM
2010
IEEE
147views Data Mining» more  ICDM 2010»
15 years 4 months ago
Subgroup Discovery Meets Bayesian Networks -- An Exceptional Model Mining Approach
Whenever a dataset has multiple discrete target variables, we want our algorithms to consider not only the variables themselves, but also the interdependencies between them. We pro...
Wouter Duivesteijn, Arno J. Knobbe, Ad Feelders, M...
DAWAK
2007
Springer
16 years 26 days ago
What-If Analysis for Data Warehouse Evolution
In this paper, we deal with the problem of performing what-if analysis for changes that occur in the schema/structure of the data warehouse We abstract software modules, queries, r...
George Papastefanatos, Panos Vassiliadis, Alkis Si...