Sciweavers

3241 search results - page 407 / 649
» Challenges for Dataset Search
Sort
View
SIGIR
2009
ACM
16 years 29 days ago
Spam filter evaluation with imprecise ground truth
When trained and evaluated on accurately labeled datasets, online email spam filters are remarkably effective, achieving error rates an order of magnitude better than classifie...
Gordon V. Cormack, Aleksander Kolcz
DASFAA
2008
IEEE
163views Database» more  DASFAA 2008»
16 years 28 days ago
Automated Data Discovery in Similarity Score Queries
A vast amount of information is being stored in scientific databases on the web. The dynamic nature of the scientific data, the cost of providing an up-to-date snapshot of the wh...
Fatih Altiparmak, Ali Saman Tosun, Hakan Ferhatosm...
ICDM
2008
IEEE
129views Data Mining» more  ICDM 2008»
16 years 28 days ago
Sequence Mining Automata: A New Technique for Mining Frequent Sequences under Regular Expressions
In this paper we study the problem of mining frequent sequences satisfying a given regular expression. Previous approaches to solve this problem were focusing on its search space,...
Roberto Trasarti, Francesco Bonchi, Bart Goethals
ICPR
2008
IEEE
16 years 27 days ago
Layered shape matching and registration: Stochastic sampling with hierarchical graph representation
To automatically register foreground target in cluttered images, we present a novel hierarchical graph representation and a stochastic computing strategy in Bayesian framework. Th...
Xiaobai Liu, Liang Lin, Hongwei Li, Hai Jin, Wenbi...
ICDM
2007
IEEE
129views Data Mining» more  ICDM 2007»
16 years 24 days ago
Semi-supervised Clustering Using Bayesian Regularization
Text clustering is most commonly treated as a fully automated task without user supervision. However, we can improve clustering performance using supervision in the form of pairwi...
Zuobing Xu, Ram Akella, Mike Ching, Renjie Tang