Sciweavers

3542 search results - page 326 / 709
» Phenomenal Data Mining: From Data to Phenomena
Sort
View
ACSC
2002
IEEE
15 years 11 months ago
Using Finite State Automata for Sequence Mining
We show how frequently occurring sequential patterns may be found from large datasets by first inducing a finite state automaton model describing the data, and then querying the m...
Philip Hingston
SDM
2010
SIAM
166views Data Mining» more  SDM 2010»
15 years 8 months ago
A Permutation Approach to Validation
We give a permutation approach to validation (estimation of out-sample error). One typical use of validation is model selection. We establish the legitimacy of the proposed permut...
Malik Magdon-Ismail, Konstantin Mertsalov
DMIN
2009
142views Data Mining» more  DMIN 2009»
15 years 4 months ago
Efficient Record Linkage using a Double Embedding Scheme
Record linkage is the problem of identifying similar records across different data sources. The similarity between two records is defined based on domain-specific similarity functi...
Noha Adly
ICDM
2005
IEEE
122views Data Mining» more  ICDM 2005»
16 years 10 days ago
Learning through Changes: An Empirical Study of Dynamic Behaviors of Probability Estimation Trees
In practice, learning from data is often hampered by the limited training examples. In this paper, as the size of training data varies, we empirically investigate several probabil...
Kun Zhang, Zujia Xu, Jing Peng, Bill P. Buckles
SDM
2011
SIAM
198views Data Mining» more  SDM 2011»
14 years 9 months ago
Exemplar-based Robust Coherent Biclustering
The biclustering, co-clustering, or subspace clustering problem involves simultaneously grouping the rows and columns of a data matrix to uncover biclusters or sub-matrices of the...
Kewei Tu, Xixiu Ouyang, Dingyi Han, Vasant Honavar