Sciweavers

2161 search results - page 225 / 433
» Learning to classify e-mail
Sort
View
IFIP12
2008
15 years 8 months ago
A Study with Class Imbalance and Random Sampling for a Decision Tree Learning System
Sampling methods are a direct approach to tackle the problem of class imbalance. These methods sample a data set in order to alter the class distributions. Usually these methods ar...
Ronaldo C. Prati, Gustavo E. A. P. A. Batista, Mar...
EMNLP
2006
15 years 8 months ago
Learning Information Status of Discourse Entities
In this paper we address the issue of automatically assigning information status to discourse entities. Using an annotated corpus of conversational English and exploiting morpho-s...
Malvina Nissim
IDA
2006
Springer
15 years 6 months ago
Classification of symbolic objects: A lazy learning approach
Symbolic data analysis aims at generalizing some standard statistical data mining methods, such as those developed for classification tasks, to the case of symbolic objects (SOs). ...
Annalisa Appice, Claudia d'Amato, Floriana Esposit...
ICML
2008
IEEE
16 years 7 months ago
Cost-sensitive multi-class classification from probability estimates
For two-class classification, it is common to classify by setting a threshold on class probability estimates, where the threshold is determined by ROC curve analysis. An analog fo...
Deirdre B. O'Brien, Maya R. Gupta, Robert M. Gray
KDD
2005
ACM
99views Data Mining» more  KDD 2005»
16 years 7 months ago
Determining an author's native language by mining a text for errors
In this paper, we show that stylistic text features can be exploited to determine an anonymous author's native language with high accuracy. Specifically, we first use automat...
Moshe Koppel, Jonathan Schler, Kfir Zigdon