Sciweavers

5046 search results - page 368 / 1010
» Non-redundant data clustering
Sort
View
PAMI
2006
134views more  PAMI 2006»
15 years 6 months ago
A Genetic Algorithm Using Hyper-Quadtrees for Low-Dimensional K-means Clustering
The k-means algorithm is widely used for clustering because of its computational efficiency. Given n points in d-dimensional space and the number of desired clusters k, k-means see...
Michael Laszlo, Sumitra Mukherjee
SIGKDD
2000
95views more  SIGKDD 2000»
15 years 6 months ago
Scalability for Clustering Algorithms Revisited
This paper presents a simple new algorithm that performs k-means clustering in one scan of a dataset, while using a bu er for points from the dataset of xed size. Experiments show...
Fredrik Farnstrom, James Lewis, Charles Elkan
ACL
2009
15 years 4 months ago
Reducing the Annotation Effort for Letter-to-Phoneme Conversion
Letter-to-phoneme (L2P) conversion is the process of producing a correct phoneme sequence for a word, given its letters. It is often desirable to reduce the quantity of training d...
Kenneth Dwyer, Grzegorz Kondrak
TCS
2011
15 years 1 months ago
Two faces of active learning
An active learner has a collection of data points, each with a label that is initially hidden but can be obtained at some cost. Without spending too much, it wishes to find a cla...
Sanjoy Dasgupta
ACL
2011
14 years 10 months ago
Can Document Selection Help Semi-supervised Learning? A Case Study On Event Extraction
Annotating training data for event extraction is tedious and labor-intensive. Most current event extraction tasks rely on hundreds of annotated documents, but this is often not en...
Shasha Liao, Ralph Grishman