Sciweavers

5046 search results - page 105 / 1010
» Non-redundant data clustering
Sort
View
BIOCOMP
2006
15 years 7 months ago
A Heuristic Approach to Scoring Gene Clustering Algorithms
In the past decades, many clustering algorithms have been proposed for the analysis of gene expression data, but little guidance is available to help choose among them. Given the ...
Longde Yin, Chun-Hsi Huang
EDM
2009
125views Data Mining» more  EDM 2009»
15 years 4 months ago
A Data Mining Approach to Reveal Representative Collaboration Indicators in Open Collaboration Frameworks
Data mining methods are successful in educational environments to discover new knowledge or learner skills or features. Unfortunately, they have not been used in depth with collabo...
Antonio R. Anaya, Jesus Boticario
CCGRID
2010
IEEE
15 years 3 months ago
File-Access Characteristics of Data-Intensive Workflow Applications
This paper studies five real-world data intensive workflow applications in the fields of natural language processing, astronomy image analysis, and web data analysis. Data intensiv...
Takeshi Shibata, SungJun Choi, Kenjiro Taura
KDD
2004
ACM
103views Data Mining» more  KDD 2004»
16 years 6 months ago
An objective evaluation criterion for clustering
We propose and test an objective criterion for evaluation of clustering performance: How well does a clustering algorithm run on unlabeled data aid a classification algorithm? The...
Arindam Banerjee, John Langford
KDD
2001
ACM
152views Data Mining» more  KDD 2001»
16 years 6 months ago
A scalable algorithm for clustering protein sequences
Valerie Guralnik, George Karypis