Search Sciweavers | Sciweavers

2193 search results - page 331 / 439

» Properties of Support Vector Machines

135

click to vote

WWW
2007
ACM

154views Internet Technology» more WWW 2007»

A clustering method for web data with multi-type interrelated components

16 years 7 months ago

Download www2007.org

Traditional clustering algorithms work on "flat" data, making the assumption that the data instances can only be represented by a set of homogeneous and uniform features...

Levent Bolelli, Seyda Ertekin, Ding Zhou, C. Lee G...

claim paper

Read More »

156

click to vote

WWW
2005
ACM

144views Internet Technology» more WWW 2005»

An experimental study on large-scale web categorization

16 years 7 months ago

Download www2005.org

Taxonomies of the Web typically have hundreds of thousands of categories and skewed category distribution over documents. It is not clear whether existing text classification tech...

Tie-Yan Liu, Yiming Yang, Hao Wan, Qian Zhou, Bin ...

claim paper

Read More »

155

click to vote

KDD
2008
ACM

167views Data Mining» more KDD 2008»

A sequential dual method for large scale multi-class linear svms

16 years 6 months ago

Download www.csie.ntu.edu.tw

Efficient training of direct multi-class formulations of linear Support Vector Machines is very useful in applications such as text classification with a huge number examples as w...

S. Sathiya Keerthi, S. Sundararajan, Kai-Wei Chang...

claim paper

Read More »

173

click to vote

KDD
2005
ACM

118views Data Mining» more KDD 2005»

On the use of linear programming for unsupervised text classification

16 years 6 months ago

Download www.cs.cornell.edu

We propose a new algorithm for dimensionality reduction and unsupervised text classification. We use mixture models as underlying process of generating corpus and utilize a novel,...

Mark Sandler

claim paper

Read More »

181

click to vote

KDD
2003
ACM

214views Data Mining» more KDD 2003»

Adaptive duplicate detection using learnable string similarity measures

16 years 6 months ago

Download www.cs.utexas.edu

The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...

Mikhail Bilenko, Raymond J. Mooney

claim paper

Read More »

« Prev « First page 331 / 439 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers