Search Sciweavers | Sciweavers

2444 search results - page 288 / 489

» A Pattern Based Data Mining Approach

168

click to vote

KDD
2006
ACM

118views Data Mining» more KDD 2006»

Reducing the human overhead in text categorization

16 years 7 months ago

Download research.microsoft.com

Many applications in text processing require significant human effort for either labeling large document collections (when learning statistical models) or extrapolating rules from...

Arnd Christian König, Eric Brill

claim paper

Read More »

191

click to vote

VLDB
2005
ACM

162views Database» more VLDB 2005»

FiST: Scalable XML Document Filtering by Sequencing Twig Patterns

16 years 17 hour ago

Download vldb.idi.ntnu.no

In recent years, publish-subscribe (pub-sub) systems based on XML document ﬁltering have received much attention. In a typical pubsub system, subscribed users specify their inte...

Joonho Kwon, Praveen Rao, Bongki Moon, Sukho Lee

claim paper

Read More »

166

click to vote

ISCI
2008

166views more ISCI 2008»

A discretization algorithm based on Class-Attribute Contingency Coefficient

15 years 6 months ago

Download sci2s.ugr.es

Discretization algorithms have played an important role in data mining and knowledge discovery. They not only produce a concise summarization of continuous attributes to help the ...

Cheng-Jung Tsai, Chien-I Lee, Wei-Pang Yang

claim paper

Read More »

191

click to vote

SADM
2008

165views more SADM 2008»

Global Correlation Clustering Based on the Hough Transform

15 years 6 months ago

Download www.dbs.ifi.lmu.de

: In this article, we propose an efficient and effective method for finding arbitrarily oriented subspace clusters by mapping the data space to a parameter space defining the set o...

Elke Achtert, Christian Böhm, Jörn David...

claim paper

Read More »

184

click to vote

KDD
2003
ACM

214views Data Mining» more KDD 2003»

Adaptive duplicate detection using learnable string similarity measures

16 years 7 months ago

Download www.cs.utexas.edu

The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...

Mikhail Bilenko, Raymond J. Mooney

claim paper

Read More »

« Prev « First page 288 / 489 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers