Sciweavers

3245 search results - page 358 / 649
» Mining Transformed Data Sets
Sort
View
SISAP
2008
IEEE
188views Data Mining» more  SISAP 2008»
16 years 1 months ago
High-Dimensional Similarity Retrieval Using Dimensional Choice
There are several pieces of information that can be utilized in order to improve the efficiency of similarity searches on high-dimensional data. The most commonly used information...
Dave Tahmoush, Hanan Samet
ICDM
2006
IEEE
130views Data Mining» more  ICDM 2006»
16 years 22 days ago
Boosting for Learning Multiple Classes with Imbalanced Class Distribution
Classification of data with imbalanced class distribution has posed a significant drawback of the performance attainable by most standard classifier learning algorithms, which ...
Yanmin Sun, Mohamed S. Kamel, Yang Wang 0007
ICDM
2005
IEEE
163views Data Mining» more  ICDM 2005»
16 years 9 days ago
Efficient Text Classification by Weighted Proximal SVM
In this paper, we present an algorithm that can classify large-scale text data with high classification quality and fast training speed. Our method is based on a novel extension o...
Dong Zhuang, Benyu Zhang, Qiang Yang, Jun Yan, Zhe...
174
Voted
KDD
1998
ACM
123views Data Mining» more  KDD 1998»
15 years 11 months ago
Scaling Clustering Algorithms to Large Databases
Practical clustering algorithms require multiple data scans to achieve convergence. For large databases, these scans become prohibitively expensive. We present a scalable clusteri...
Paul S. Bradley, Usama M. Fayyad, Cory Reina
SDM
2004
SIAM
242views Data Mining» more  SDM 2004»
15 years 8 months ago
Privacy-Preserving Multivariate Statistical Analysis: Linear Regression and Classification
Multivariate statistical analysis is an important data analysis technique that has found applications in various areas. In this paper, we study some multivariate statistical analy...
Wenliang Du, Yunghsiang S. Han, Shigang Chen