Sciweavers

3716 search results - page 259 / 744
» On the monotonization of the training set
Sort
View
UAI
2008
15 years 8 months ago
Small Sample Inference for Generalization Error in Classification Using the CUD Bound
Confidence measures for the generalization error are crucial when small training samples are used to construct classifiers. A common approach is to estimate the generalization err...
Eric Laber, Susan Murphy
ICDE
2009
IEEE
210views Database» more  ICDE 2009»
16 years 8 months ago
Keyword Search in Spatial Databases: Towards Searching by Document
This work addresses a novel spatial keyword query called the m-closest keywords (mCK) query. Given a database of spatial objects, each tuple is associated with some descriptive inf...
Dongxiang Zhang, Yeow Meng Chee, Anirban Mondal, A...
ICDE
2007
IEEE
115views Database» more  ICDE 2007»
16 years 8 months ago
Preservation Of Patterns and Input-Output Privacy
Abstract breaches. To do so, the data custodian needs to transform its data. To determine the appropriate transforPrivacy preserving data mining so far has mainly mation, there are...
Shaofeng Bu, Laks V. S. Lakshmanan, Raymond T. Ng,...
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
16 years 7 months ago
Enhanced word clustering for hierarchical text classification
In this paper we propose a new information-theoretic divisive algorithm for word clustering applied to text classification. In previous work, such "distributional clustering&...
Inderjit S. Dhillon, Subramanyam Mallela, Rahul Ku...
ICML
2009
IEEE
16 years 1 months ago
Non-monotonic feature selection
We consider the problem of selecting a subset of m most informative features where m is the number of required features. This feature selection problem is essentially a combinator...
Zenglin Xu, Rong Jin, Jieping Ye, Michael R. Lyu, ...