Sciweavers

2988 search results - page 274 / 598
» Business applications of data mining
Sort
View
SIGMOD
2001
ACM
193views Database» more  SIGMOD 2001»
16 years 6 months ago
Epsilon Grid Order: An Algorithm for the Similarity Join on Massive High-Dimensional Data
The similarity join is an important database primitive which has been successfully applied to speed up applications such as similarity search, data analysis and data mining. The s...
Christian Böhm, Bernhard Braunmüller, Fl...
SIGSOFT
2010
ACM
15 years 4 months ago
Software is data too
Software systems are designed and engineered to process data. However, software is data too. The size and variety of today's software artifacts and the multitude of stakehold...
Andrian Marcus, Tim Menzies
KDD
2010
ACM
222views Data Mining» more  KDD 2010»
15 years 8 months ago
Large linear classification when data cannot fit in memory
Recent advances in linear classification have shown that for applications such as document classification, the training can be extremely efficient. However, most of the existing t...
Hsiang-Fu Yu, Cho-Jui Hsieh, Kai-Wei Chang, Chih-J...
SDM
2012
SIAM
452views Data Mining» more  SDM 2012»
13 years 9 months ago
Density-based Projected Clustering over High Dimensional Data Streams
Clustering of high dimensional data streams is an important problem in many application domains, a prominent example being network monitoring. Several approaches have been lately ...
Irene Ntoutsi, Arthur Zimek, Themis Palpanas, Peer...
ICDM
2006
IEEE
129views Data Mining» more  ICDM 2006»
16 years 22 days ago
Consensus Clustering for Detection of Overlapping Clusters in Microarray Data
Most clustering algorithms are partitional in nature, assigning each data point to exactly one cluster. However, several real world datasets have inherently overlapping clusters i...
Meghana Deodhar, Joydeep Ghosh