Sciweavers

6484 search results - page 949 / 1297
» Physical Database Design
Sort
View
KDD
2005
ACM
137views Data Mining» more  KDD 2005»
16 years 7 months ago
Pattern-based similarity search for microarray data
One fundamental task in near-neighbor search as well as other similarity matching efforts is to find a distance function that can efficiently quantify the similarity between two o...
Haixun Wang, Jian Pei, Philip S. Yu
213
Voted
KDD
2004
ACM
106views Data Mining» more  KDD 2004»
16 years 7 months ago
Early detection of insider trading in option markets
"Inside information" comes in many forms: knowledge of a corporate takeover, a terrorist attack, unexpectedly poor earnings, the FDA's acceptance of a new drug, etc...
Steve Donoho
KDD
2004
ACM
126views Data Mining» more  KDD 2004»
16 years 7 months ago
Turning CARTwheels: an alternating algorithm for mining redescriptions
We present an unusual algorithm involving classification trees-CARTwheels--where two trees are grown in opposite directions so that they are joined at their leaves. This approach ...
Naren Ramakrishnan, Deept Kumar, Bud Mishra, Malco...
KDD
2004
ACM
302views Data Mining» more  KDD 2004»
16 years 7 months ago
Redundancy based feature selection for microarray data
In gene expression microarray data analysis, selecting a small number of discriminative genes from thousands of genes is an important problem for accurate classification of diseas...
Lei Yu, Huan Liu
179
Voted
KDD
2003
ACM
243views Data Mining» more  KDD 2003»
16 years 7 months ago
Accurate decision trees for mining high-speed data streams
In this paper we study the problem of constructing accurate decision tree models from data streams. Data streams are incremental tasks that require incremental, online, and any-ti...
João Gama, Pedro Medas, Ricardo Rocha