Search Sciweavers | Sciweavers

14285 search results - page 2444 / 2857

» Web Based Management

185

click to vote

KDD
2004
ACM

117views Data Mining» more KDD 2004»

Systematic data selection to mine concept-drifting data streams

16 years 7 months ago

Download www.weifan.info

One major problem of existing methods to mine data streams is that it makes ad hoc choices to combine most recent data with some amount of old data to search the new hypothesis. T...

Wei Fan

claim paper

Read More »

177

click to vote

KDD
2004
ACM

195views Data Mining» more KDD 2004»

Improved robustness of signature-based near-replica detection via lexicon randomization

16 years 7 months ago

Download ir.iit.edu

Detection of near duplicate documents is an important problem in many data mining and information filtering applications. When faced with massive quantities of data, traditional d...

Aleksander Kolcz, Abdur Chowdhury, Joshua Alspecto...

claim paper

Read More »

165

click to vote

KDD
2004
ACM

110views Data Mining» more KDD 2004»

Generalizing the notion of support

16 years 7 months ago

Download www-users.cs.umn.edu

The goal of this paper is to show that generalizing the notion of support can be useful in extending association analysis to non-traditional types of patterns and non-binary data....

Michael Steinbach, Pang-Ning Tan, Hui Xiong, Vipin...

claim paper

Read More »

188

click to vote

KDD
2004
ACM

164views Data Mining» more KDD 2004»

Ordering patterns by combining opinions from multiple sources

16 years 7 months ago

Download www.cse.msu.edu

Pattern ordering is an important task in data mining because the number of patterns extracted by standard data mining algorithms often exceeds our capacity to manually analyze the...

Pang-Ning Tan, Rong Jin

claim paper

Read More »

194

click to vote

KDD
2003
ACM

214views Data Mining» more KDD 2003»

Adaptive duplicate detection using learnable string similarity measures

16 years 7 months ago

Download www.cs.utexas.edu

The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...

Mikhail Bilenko, Raymond J. Mooney

claim paper

Read More »

« Prev « First page 2444 / 2857 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers