Sciweavers

17688 search results - page 196 / 3538
» Data Set Balancing
Sort
View
CICLING
2006
Springer
15 years 10 months ago
Improving kNN Text Categorization by Removing Outliers from Training Set
We show that excluding outliers from the training data significantly improves kNN classifier, which in this case performs about 10% better than the best know method--Centroid-based...
Kwangcheol Shin, Ajith Abraham, Sang-Yong Han
BMCBI
2004
88views more  BMCBI 2004»
15 years 6 months ago
PhyME: A probabilistic algorithm for finding motifs in sets of orthologous sequences
Background: This paper addresses the problem of discovering transcription factor binding sites in heterogeneous sequence data, which includes regulatory sequences of one or more g...
Saurabh Sinha, Mathieu Blanchette, Martin Tompa
ICCV
2001
IEEE
16 years 8 months ago
Feature Selection from Huge Feature Sets
The number of features that can be computed over an image is, for practical purposes, limitless. Unfortunately, the number of features that can be computed and exploited by most c...
José Bins, Bruce A. Draper
CORR
2010
Springer
173views Education» more  CORR 2010»
15 years 6 months ago
CONCISE: Compressed 'n' Composable Integer Set
Bit arrays, or bitmaps, are used to significantly speed up set operations in several areas, such as data warehousing, information retrieval, and data mining, to cite a few. Howeve...
Alessandro Colantonio, Roberto Di Pietro
ICMCS
2009
IEEE
159views Multimedia» more  ICMCS 2009»
15 years 4 months ago
Acoustic modeling using an extended phone set considering cross-lingual pronunciation variations
To deal with the issue of data unbalanced condition among a task of multilingual speech recognition and a phenomenon of pronunciation variations across languages, we propose an ap...
Dau-Cheng Lyu, Ren-Yuan Lyu, Ming-Tat Ko