Sciweavers

6279 search results - page 350 / 1256
» Studies in Solution Sampling
Sort
View
LREC
2010
176views Education» more  LREC 2010»
15 years 8 months ago
There's no Data like More Data? Revisiting the Impact of Data Size on a Classification Task
In the paper we investigate the impact of data size on a Word Sense Disambiguation task (WSD). We question the assumption that the knowledge acquisition bottleneck, which is known...
Ines Rehbein, Josef Ruppenhofer
LREC
2010
178views Education» more  LREC 2010»
15 years 8 months ago
Like Finding a Needle in a Haystack: Annotating the American National Corpus for Idiomatic Expressions
This paper presents the details of a pilot study in which we tagged portions of the American National Corpus (ANC) for idioms composed of verb-noun constructions, prepositional ph...
Laura Street, Nathan Michalov, Rachel Silverstein,...
SODA
2008
ACM
200views Algorithms» more  SODA 2008»
15 years 8 months ago
Clustering for metric and non-metric distance measures
We study a generalization of the k-median problem with respect to an arbitrary dissimilarity measure D. Given a finite set P, our goal is to find a set C of size k such that the s...
Marcel R. Ackermann, Johannes Blömer, Christi...
SDM
2007
SIAM
107views Data Mining» more  SDM 2007»
15 years 8 months ago
On Demand Phenotype Ranking through Subspace Clustering
High throughput biotechnologies have enabled scientists to collect a large number of genetic and phenotypic attributes for a large collection of samples. Computational methods are...
Xiang Zhang, Wei Wang 0010, Jun Huan
ISMB
2001
15 years 8 months ago
Molecular classification of multiple tumor types
Using gene expression data to classify tumor types is a very promising tool in cancer diagnosis. Previous works show several pairs of tumor types can be successfully distinguished...
Chen-Hsiang Yeang, Sridhar Ramaswamy, Pablo Tamayo...