Sciweavers

200

KDD
2005
ACM

130views Data Mining» more KDD 2005»

Regression error characteristic surfaces

16 years 7 months ago

This paper presents a generalization of Regression Error Characteristic (REC) curves. REC curves describe the cumulative distribution function of the prediction error of models an...

Luís Torgo

claim paper

Read More »

181

click to vote

KDD
2005
ACM

185views Data Mining» more KDD 2005»

Mining comparable bilingual text corpora for cross-language information integration

16 years 7 months ago

Download sifaka.cs.uiuc.edu

Integrating information in multiple natural languages is a challenging task that often requires manually created linguistic resources such as a bilingual dictionary or examples of...

Tao Tao, ChengXiang Zhai

claim paper

Read More »

204

click to vote

KDD
2005
ACM

125views Data Mining» more KDD 2005»

Email data cleaning

16 years 7 months ago

Download research.microsoft.com

Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...

Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang

claim paper

Read More »

178

click to vote

KDD
2005
ACM

135views Data Mining» more KDD 2005»

A hybrid unsupervised approach for document clustering

16 years 7 months ago

Download www.surdeanu.name

We propose a hybrid, unsupervised document clustering approach that combines a hierarchical clustering algorithm with Expectation Maximization. We developed several heuristics to ...

Mihai Surdeanu, Jordi Turmo, Alicia Ageno

claim paper

Read More »

176

click to vote

KDD
2005
ACM

181views Data Mining» more KDD 2005»

16 years 7 months ago

Evaluating similarity measures: a large-scale study in the orkut social network

Download research.google.com

Online information services have grown too large for users to navigate without the help of automated tools such as collaborative filtering, which makes recommendations to users ba...

Ellen Spertus, Mehran Sahami, Orkut Buyukkokten

claim paper

Read More »

203

click to vote

KDD
2005
ACM

192views Data Mining» more KDD 2005»

Modeling and predicting personal information dissemination behavior

16 years 7 months ago

Download delivery.acm.org

In this paper, we propose a new way to automatically model and predict human behavior of receiving and disseminating information by analyzing the contact and content of personal c...

Xiaodan Song, Ching-Yung Lin, Belle L. Tseng, Ming...

claim paper

Read More »

142

click to vote

KDD
2005
ACM

86views Data Mining» more KDD 2005»

Probabilistic workflow mining

16 years 7 months ago

Download www.cs.cmu.edu

In several organizations, it has become increasingly popular to document and log the steps that makeup a typical business process. In some situations, a normative workflow model o...

Ricardo Silva, Jiji Zhang, James G. Shanahan

claim paper

Read More »

151

click to vote

KDD
2005
ACM

124views Data Mining» more KDD 2005»

A multinomial clustering model for fast simulation of computer architecture designs

16 years 7 months ago

Download www.ece.neu.edu

Computer architects utilize simulation tools to evaluate the merits of a new design feature. The time needed to adequately evaluate the tradeoffs associated with adding any new fe...

Kaushal Sanghai, Ting Su, Jennifer G. Dy, David R....

claim paper

Read More »

183

click to vote

KDD
2005
ACM

118views Data Mining» more KDD 2005»

On the use of linear programming for unsupervised text classification

16 years 7 months ago

Download www.cs.cornell.edu

We propose a new algorithm for dimensionality reduction and unsupervised text classification. We use mixture models as underlying process of generating corpus and utilize a novel,...

Mark Sandler

claim paper

Read More »

177

click to vote

KDD
2005
ACM

177views Data Mining» more KDD 2005»

Query chains: learning to rank from implicit feedback

16 years 7 months ago

Download www.cs.cornell.edu

This paper presents a novel approach for using clickthrough data to learn ranked retrieval functions for web search results. We observe that users searching the web often perform ...

Filip Radlinski, Thorsten Joachims

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers