Sciweavers

14285 search results - page 2444 / 2857
» Web Based Management
Sort
View
KDD
2004
ACM
117views Data Mining» more  KDD 2004»
16 years 7 months ago
Systematic data selection to mine concept-drifting data streams
One major problem of existing methods to mine data streams is that it makes ad hoc choices to combine most recent data with some amount of old data to search the new hypothesis. T...
Wei Fan
KDD
2004
ACM
195views Data Mining» more  KDD 2004»
16 years 7 months ago
Improved robustness of signature-based near-replica detection via lexicon randomization
Detection of near duplicate documents is an important problem in many data mining and information filtering applications. When faced with massive quantities of data, traditional d...
Aleksander Kolcz, Abdur Chowdhury, Joshua Alspecto...
KDD
2004
ACM
110views Data Mining» more  KDD 2004»
16 years 7 months ago
Generalizing the notion of support
The goal of this paper is to show that generalizing the notion of support can be useful in extending association analysis to non-traditional types of patterns and non-binary data....
Michael Steinbach, Pang-Ning Tan, Hui Xiong, Vipin...
KDD
2004
ACM
164views Data Mining» more  KDD 2004»
16 years 7 months ago
Ordering patterns by combining opinions from multiple sources
Pattern ordering is an important task in data mining because the number of patterns extracted by standard data mining algorithms often exceeds our capacity to manually analyze the...
Pang-Ning Tan, Rong Jin
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
16 years 7 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
« Prev « First page 2444 / 2857 Last » Next »