Search Sciweavers | Sciweavers

2840 search results - page 347 / 568

» Learning Phrasal Categories

186

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 7 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

175

click to vote

KDD
2007
ACM

159views Data Mining» more KDD 2007»

Local decomposition for rare class analysis

16 years 7 months ago

Download datamining.rutgers.edu

Given its importance, the problem of predicting rare classes in large-scale multi-labeled data sets has attracted great attentions in the literature. However, the rare-class probl...

Junjie Wu, Hui Xiong, Peng Wu, Jian Chen

claim paper

Read More »

205

click to vote

KDD
2007
ACM

154views Data Mining» more KDD 2007»

Canonicalization of database records using adaptive similarity measures

16 years 7 months ago

Download www.cs.umass.edu

It is becoming increasingly common to construct databases from information automatically culled from many heterogeneous sources. For example, a research publication database can b...

Aron Culotta, Michael L. Wick, Robert Hall, Matthe...

claim paper

Read More »

207

click to vote

KDD
2006
ACM

179views Data Mining» more KDD 2006»

Extracting key-substring-group features for text classification

16 years 7 months ago

Download www.comp.nus.edu.sg

In many text classification applications, it is appealing to take every document as a string of characters rather than a bag of words. Previous research studies in this area mostl...

Dell Zhang, Wee Sun Lee

claim paper

Read More »

196

click to vote

KDD
2006
ACM

115views Data Mining» more KDD 2006»

Supervised probabilistic principal component analysis

16 years 7 months ago

Download wwwbrauer.informatik.tu-muenchen.de

Principal component analysis (PCA) has been extensively applied in data mining, pattern recognition and information retrieval for unsupervised dimensionality reduction. When label...

Shipeng Yu, Kai Yu, Volker Tresp, Hans-Peter Krieg...

claim paper

Read More »

« Prev « First page 347 / 568 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers