Sciweavers

1083 search results - page 101 / 217
» Efficient Discovery of Confounders in Large Data Sets
Sort
View
DEXA
2005
Springer
124views Database» more  DEXA 2005»
15 years 12 months ago
Scalable Distributed Aggregate Computations Through Collaboration
Computing aggregates over distributed data sets constitutes an interesting class of distributed queries. Recent advances in peer-to-peer discovery of data sources and query process...
Leonidas Galanis, David J. DeWitt
RCIS
2010
15 years 4 months ago
A Tree-based Approach for Efficiently Mining Approximate Frequent Itemsets
—The strategies for mining frequent itemsets, which is the essential part of discovering association rules, have been widely studied over the last decade. In real-world datasets,...
Jia-Ling Koh, Yi-Lang Tu
EDBT
2009
ACM
132views Database» more  EDBT 2009»
16 years 1 months ago
A novel approach for efficient supergraph query processing on graph databases
In recent years, large amount of data modeled by graphs, namely graph data, have been collected in various domains. Efficiently processing queries on graph databases has attracted...
Shuo Zhang, Jianzhong Li, Hong Gao, Zhaonian Zou
KDD
2001
ACM
163views Data Mining» more  KDD 2001»
16 years 6 months ago
The "DGX" distribution for mining massive, skewed data
Skewed distributions appear very often in practice. Unfortunately, the traditional Zipf distribution often fails to model them well. In this paper, we propose a new probability di...
Zhiqiang Bi, Christos Faloutsos, Flip Korn
BMCBI
2004
169views more  BMCBI 2004»
15 years 6 months ago
A power law global error model for the identification of differentially expressed genes in microarray data
Background: High-density oligonucleotide microarray technology enables the discovery of genes that are transcriptionally modulated in different biological samples due to physiolog...
Norman Pavelka, Mattia Pelizzola, Caterina Vizzard...