Sciweavers

166

KDD
2002
ACM

157views Data Mining» more KDD 2002»

Exploiting unlabeled data in ensemble methods

16 years 7 months ago

An adaptive semi-supervised ensemble method, ASSEMBLE, is proposed that constructs classification ensembles based on both labeled and unlabeled data. ASSEMBLE alternates between a...

Kristin P. Bennett, Ayhan Demiriz, Richard Maclin

claim paper

Read More »

137

click to vote

KDD
2002
ACM

96views Data Mining» more KDD 2002»

A theoretical framework for learning from a pool of disparate data sources

16 years 7 months ago

Download www.cs.cornell.edu

Shai Ben-David, Johannes Gehrke, Reba Schuller

claim paper

Read More »

191

click to vote

KDD
2002
ACM

166views Data Mining» more KDD 2002»

Frequent term-based text clustering

16 years 7 months ago

Download www.cs.sfu.ca

Text clustering methods can be used to structure large sets of text or hypertext documents. The well-known methods of text clustering, however, do not really address the special p...

Florian Beil, Martin Ester, Xiaowei Xu

claim paper

Read More »

183

click to vote

KDD
2002
ACM

115views Data Mining» more KDD 2002»

Collaborative crawling: mining user experiences for topical resource discovery

16 years 7 months ago

Download charuaggarwal.net

The rapid growth of the world wide web had made the problem of topic speci c resource discovery an important one in recent years. In this problem, it is desired to nd web pages wh...

Charu C. Aggarwal

claim paper

Read More »

127

click to vote

KDD
2002
ACM

119views Data Mining» more KDD 2002»

On effective classification of strings with wavelets

16 years 7 months ago

Download www.charuaggarwal.net

In recent years, the technological advances in mapping genes have made it increasingly easy to store and use a wide variety of biological data. Such data are usually in the form o...

Charu C. Aggarwal

claim paper

Read More »

166

click to vote

KDD
2002
ACM

109views Data Mining» more KDD 2002»

Topics in 0--1 data

16 years 7 months ago

Download www.cis.hut.fi

Large 0-1 datasets arise in various applications, such as market basket analysis and information retrieval. We concentrate on the study of topic models, aiming at results which in...

Ella Bingham, Heikki Mannila, Jouni K. Seppän...

claim paper

Read More »

192

click to vote

KDD
2002
ACM

189views Data Mining» more KDD 2002»

Sequential PAttern mining using a bitmap representation

16 years 7 months ago

Download www.cs.cornell.edu

We introduce a new algorithm for mining sequential patterns. Our algorithm is especially efficient when the sequential patterns in the database are very long. We introduce a novel...

Jay Ayres, Jason Flannick, Johannes Gehrke, Tomi Y...

claim paper

Read More »

189

click to vote

KDD
2002
ACM

136views Data Mining» more KDD 2002»

Relational Markov models and their application to adaptive web navigation

16 years 7 months ago

Download www.cs.washington.edu

Relational Markov models (RMMs) are a generalization of Markov models where states can be of different types, with each type described by a different set of variables. The domain ...

Corin R. Anderson, Pedro Domingos, Daniel S. Weld

claim paper

Read More »

198

click to vote

KDD
2002
ACM

147views Data Mining» more KDD 2002»

Visualized Classification of Multiple Sample Types

16 years 7 months ago

Download www.cse.buffalo.edu

The goal of the knowledge discovery and data mining is to extract the useful knowledge from the given data. Visualization enables us to find structures, features, patterns, and re...

Li Zhang, Aidong Zhang, Murali Ramanathan

claim paper

Read More »

141

click to vote

KDD
2002
ACM

107views Data Mining» more KDD 2002»

Clustering and Classifying Enzymes in Metabolic Pathways: Some Preliminary Results

16 years 7 months ago