Sciweavers

1006 search results - page 50 / 202
» A Case Study for Learning from Imbalanced Data Sets
Sort
View
IJCAI
2001
15 years 7 months ago
Active Learning for Class Probability Estimation and Ranking
For many supervised learning tasks it is very costly to produce training data with class labels. Active learning acquires data incrementally, at each stage using the model learned...
Maytal Saar-Tsechansky, Foster J. Provost
BMCBI
2006
150views more  BMCBI 2006»
15 years 6 months ago
Predicting protein subcellular locations using hierarchical ensemble of Bayesian classifiers based on Markov chains
Background: The subcellular location of a protein is closely related to its function. It would be worthwhile to develop a method to predict the subcellular location for a given pr...
Alla Bulashevska, Roland Eils
DATAMINE
2006
230views more  DATAMINE 2006»
15 years 6 months ago
Mining top-K frequent itemsets from data streams
Frequent pattern mining on data streams is of interest recently. However, it is not easy for users to determine a proper frequency threshold. It is more reasonable to ask users to ...
Raymond Chi-Wing Wong, Ada Wai-Chee Fu
VLDB
2007
ACM
118views Database» more  VLDB 2007»
16 years 6 months ago
Inferring XML Schema Definitions from XML Data
Although the presence of a schema enables many optimizations for operations on XML documents, recent studies have shown that many XML documents in practice either do not refer to ...
Geert Jan Bex, Frank Neven, Stijn Vansummeren
JCDL
2005
ACM
100views Education» more  JCDL 2005»
15 years 11 months ago
Automatic extraction of titles from general documents using machine learning
In this paper, we propose a machine learning approach to title extraction from general documents. By general documents, we mean documents that can belong to any one of a number of...
Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyerzon, Q...