Sciweavers

3245 search results - page 270 / 649
» Mining Transformed Data Sets
Sort
View
SDM
2003
SIAM
156views Data Mining» more  SDM 2003»
15 years 8 months ago
Detection of Underrepresented Biological Sequences using Class-Conditional Distribution Models
A labeled sequence data set related to a certain biological property is often biased and, therefore, does not completely capture its diversity in nature. To reduce this sampling b...
Slobodan Vucetic, Dragoljub Pokrajac, Hongbo Xie, ...
ICDE
2008
IEEE
195views Database» more  ICDE 2008»
16 years 8 months ago
LOCUST: An Online Analytical Processing Framework for High Dimensional Classification of Data Streams
Abstract-- In recent years, data streams have become ubiquitous because of advances in hardware and software technology. The ability to adapt conventional mining problems to data s...
Charu C. Aggarwal, Philip S. Yu
KDD
2005
ACM
166views Data Mining» more  KDD 2005»
16 years 7 months ago
A general model for clustering binary data
Clustering is the problem of identifying the distribution of patterns and intrinsic correlations in large data sets by partitioning the data points into similarity classes. This p...
Tao Li
ICDM
2009
IEEE
122views Data Mining» more  ICDM 2009»
16 years 1 months ago
Detecting and Interpreting Variable Interactions in Observational Ornithology Data
In this paper we demonstrate a practical approach to interaction detection on real data describing the abundance of different species of birds in the prairies east of the souther...
Daria Sorokina, Rich Caruana, Mirek Riedewald, Wes...
ICDM
2005
IEEE
138views Data Mining» more  ICDM 2005»
16 years 8 days ago
Labeling Unclustered Categorical Data into Clusters Based on the Important Attribute Values
Sampling has been recognized as an important technique to improve the efficiency of clustering. However, with sampling applied, those points which are not sampled will not have t...
Hung-Leng Chen, Kun-Ta Chuang, Ming-Syan Chen