Sciweavers

1277 search results - page 122 / 256
» The Google Similarity Distance
Sort
View
SIGIR
2002
ACM
15 years 6 months ago
Document clustering with committees
Document clustering is useful in many information retrieval tasks: document browsing, organization and viewing of retrieval results, generation of Yahoo-like hierarchies of docume...
Patrick Pantel, Dekang Lin
CIDM
2011
IEEE
14 years 10 months ago
Partial generalized correlation for hyperspectral data
Abstract—A variational approach is proposed for the unsupervised assessment of attribute variability of high-dimensional data given a differentiable similarity measure. The key q...
Marc Strickert, Bjorn Labitzke, Volker Blanz
DMIN
2006
293views Data Mining» more  DMIN 2006»
15 years 7 months ago
Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study
This paper presents the results of classifying Arabic text documents using the N-gram frequency statistics technique employing a dissimilarity measure called the "Manhattan di...
Laila Khreisat
KDD
2004
ACM
132views Data Mining» more  KDD 2004»
16 years 6 months ago
A probabilistic framework for semi-supervised clustering
Unsupervised clustering can be significantly improved using supervision in the form of pairwise constraints, i.e., pairs of instances labeled as belonging to same or different clu...
Sugato Basu, Mikhail Bilenko, Raymond J. Mooney
BIBE
2009
IEEE
126views Bioinformatics» more  BIBE 2009»
16 years 1 months ago
Mining Positional Association Super-Rules on Fixed-Size Protein Sequence Motifs
— Protein sequence motifs information is crucial to the analysis of biologically significant regions. The conserved regions have the potential to determine the role of the protei...
Bernard Chen, Sinan Kockara