Sciweavers

2277 search results - page 148 / 456
» Clustering by pattern similarity in large data sets
Sort
View
SDM
2004
SIAM
162views Data Mining» more  SDM 2004»
15 years 7 months ago
Subspace Clustering of High Dimensional Data
Clustering suffers from the curse of dimensionality, and similarity functions that use all input features with equal relevance may not be effective. We introduce an algorithm that...
Carlotta Domeniconi, Dimitris Papadopoulos, Dimitr...
PAKDD
2009
ACM
96views Data Mining» more  PAKDD 2009»
16 years 1 months ago
Aggregated Subset Mining
The usual data mining setting uses the full amount of data to derive patterns for different purposes. Taking cues from machine learning techniques, we explore ways to divide the d...
Albrecht Zimmermann, Björn Bringmann
ACST
2006
15 years 8 months ago
Distributed hierarchical document clustering
This paper investigates the applicability of distributed clustering technique, called RACHET [1], to organize large sets of distributed text data. Although the authors of RACHET c...
Debzani Deb, M. Muztaba Fuad, Rafal A. Angryk
IJAR
2007
113views more  IJAR 2007»
15 years 6 months ago
Fuzzy clustering in parallel universes
We present an extension of the fuzzy c-Means algorithm, which operates simultaneously on different feature spaces—so-called parallel universes—and also incorporates noise det...
Bernd Wiswedel, Michael R. Berthold
ICML
2010
IEEE
15 years 7 months ago
Power Iteration Clustering
We present a simple and scalable graph clustering method called power iteration clustering (PIC). PIC finds a very low-dimensional embedding of a dataset using truncated power ite...
Frank Lin, William W. Cohen