Sciweavers

2277 search results - page 104 / 456
» Clustering by pattern similarity in large data sets
Sort
View
TKDE
2012
253views Formal Methods» more  TKDE 2012»
13 years 8 months ago
Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
—Preparing a data set for analysis is generally the most time consuming task in a data mining project, requiring many complex SQL queries, joining tables and aggregating columns....
Carlos Ordonez, Zhibo Chen 0002
EDBT
2004
ACM
192views Database» more  EDBT 2004»
16 years 6 months ago
LIMBO: Scalable Clustering of Categorical Data
Abstract. Clustering is a problem of great practical importance in numerous applications. The problem of clustering becomes more challenging when the data is categorical, that is, ...
Periklis Andritsos, Panayiotis Tsaparas, Ren&eacut...
FLAIRS
2006
15 years 7 months ago
Mining the Web to Determine Similarity Between Words, Objects, and Communities
The World Wide Web provides a wealth of data that can be harnessed to help improve information retrieval and increase understanding of the relationships between different entities...
Mehran Sahami
CIKM
2009
Springer
15 years 9 months ago
Mining data streams with periodically changing distributions
Dynamic data streams are those whose underlying distribution changes over time. They occur in a number of application domains, and mining them is important for these applications....
Yingying Tao, M. Tamer Özsu
KDD
2010
ACM
233views Data Mining» more  KDD 2010»
15 years 10 months ago
Evolutionary hierarchical dirichlet processes for multiple correlated time-varying corpora
Mining cluster evolution from multiple correlated time-varying text corpora is important in exploratory text analytics. In this paper, we propose an approach called evolutionary h...
Jianwen Zhang, Yangqiu Song, Changshui Zhang, Shix...