Sciweavers

7053 search results - page 394 / 1411
» Data Mining of Multi-categorized Data
Sort
View
KDD
2009
ACM
191views Data Mining» more  KDD 2009»
16 years 7 months ago
BBM: bayesian browsing model from petabyte-scale data
Given a quarter of petabyte click log data, how can we estimate the relevance of each URL for a given query? In this paper, we propose the Bayesian Browsing Model (BBM), a new mod...
Chao Liu 0001, Christos Faloutsos, Fan Guo
KDD
2009
ACM
190views Data Mining» more  KDD 2009»
16 years 1 months ago
Algebraic visual analysis: the Catalano phone call data set case study
While many clever techniques have been proposed for visual analysis, most of these are “one of” and it is not easy to see how to combine multiple techniques. We propose an alg...
Anna A. Shaverdian, Hao Zhou, George Michailidis, ...
IDA
2003
Springer
16 years 2 days ago
Distributed Regression for Heterogeneous Data Sets
Existing meta-learning based distributed data mining approaches do not explicitly address context heterogeneity across individual sites. This limitation constrains their applicatio...
Yan Xing, Michael G. Madden, Jim Duggan, Gerard Ly...
JDWM
2006
84views more  JDWM 2006»
15 years 6 months ago
Discovering Surprising Instances of Simpson's Paradox in Hierarchical Multidimensional Data
This paper focuses on the discovery of surprising, unexpected patterns, based on a data mining method that consists of detecting instances of Simpson's paradox. By its very n...
Carem C. Fabris, Alex Alves Freitas
EUROPAR
1999
Springer
15 years 11 months ago
Parallel k/h-Means Clustering for Large Data Sets
This paper describes the realization of a parallel version of the k/h-means clustering algorithm. This is one of the basic algorithms used in a wide range of data mining tasks. We ...
Kilian Stoffel, Abdelkader Belkoniene