Sciweavers

3716 search results - page 279 / 744
» On the monotonization of the training set
Sort
View
ICTAI
2007
IEEE
16 years 29 days ago
Automatic Personalized Spam Filtering through Significant Word Modeling
Typically, spam filters are built on the assumption that the characteristics of e-mails in the training set is identical to those in individual users’ inboxes on which it will b...
Khurum Nazir Junejo, Asim Karim
CIKM
2007
Springer
16 years 25 days ago
Developing learning strategies for topic-based summarization
Most up-to-date well-behaved topic-based summarization systems are built upon the extractive framework. They score the sentences based on the associated features by manually assig...
Ouyang You, Sujian Li, Wenjie Li
ECML
2007
Springer
16 years 24 days ago
Avoiding Boosting Overfitting by Removing Confusing Samples
Boosting methods are known to exhibit noticeable overfitting on some datasets, while being immune to overfitting on other ones. In this paper we show that standard boosting algorit...
Alexander Vezhnevets, Olga Barinova
ICDM
2006
IEEE
76views Data Mining» more  ICDM 2006»
16 years 21 days ago
A Probabilistic Ensemble Pruning Algorithm
An ensemble is a group of learners that work together as a committee to solve a problem. However, the existing ensemble training algorithms sometimes generate unnecessary large en...
Huanhuan Chen, Peter Tiño, Xin Yao
ICMCS
2006
IEEE
192views Multimedia» more  ICMCS 2006»
16 years 20 days ago
Classifier Optimization for Multimedia Semantic Concept Detection
In this paper, we present an AUC (i.e., the Area Under the Curve of Receiver Operating Characteristics (ROC)) maximization based learning algorithm to design the classifier for ma...
Sheng Gao, Qibin Sun