Sciweavers

3245 search results - page 487 / 649
» Mining Transformed Data Sets
Sort
View
ICDM
2010
IEEE
273views Data Mining» more  ICDM 2010»
15 years 4 months ago
Learning Maximum Lag for Grouped Graphical Granger Models
Temporal causal modeling has been a highly active research area in the last few decades. Temporal or time series data arises in a wide array of application domains ranging from med...
Amit Dhurandhar
KDD
2009
ACM
167views Data Mining» more  KDD 2009»
16 years 7 months ago
Seven pitfalls to avoid when running controlled experiments on the web
Controlled experiments, also called randomized experiments and A/B tests, have had a profound influence on multiple fields, including medicine, agriculture, manufacturing, and adv...
Thomas Crook, Brian Frasca, Ron Kohavi, Roger Long...
ICDE
2007
IEEE
165views Database» more  ICDE 2007»
16 years 7 months ago
On Randomization, Public Information and the Curse of Dimensionality
A key method for privacy preserving data mining is that of randomization. Unlike k-anonymity, this technique does not include public information in the underlying assumptions. In ...
Charu C. Aggarwal
STOC
2001
ACM
134views Algorithms» more  STOC 2001»
16 years 6 months ago
Data-streams and histograms
Histograms are typically used to approximate data distributions. Histograms and related synopsis structures have been successful in a wide variety of popular database applications...
Sudipto Guha, Nick Koudas, Kyuseok Shim
DEXA
2007
Springer
154views Database» more  DEXA 2007»
16 years 20 days ago
Performance Oriented Schema Matching
Abstract. Semantic matching of schemas in heterogeneous data sharing systems is time consuming and error prone. Existing mapping tools employ semi-automatic techniques for mapping ...
Khalid Saleem, Zohra Bellahsene, Ela Hunt