Sciweavers

5489 search results - page 377 / 1098
» Evaluating evaluation measure stability
Sort
View
153
Voted
EMNLP
2010
15 years 4 months ago
Two Decades of Unsupervised POS Induction: How Far Have We Come?
Part-of-speech (POS) induction is one of the most popular tasks in research on unsupervised NLP. Many different methods have been proposed, yet comparisons are difficult to make s...
Christos Christodoulopoulos, Sharon Goldwater, Mar...
197
Voted
CORR
2011
Springer
209views Education» more  CORR 2011»
15 years 1 months ago
An Empirical Study of Real-World SPARQL Queries
Understanding how users tailor their SPARQL queries is crucial when designing query evaluation engines or fine-tuning RDF stores with performance in mind. In this paper we analyz...
Mario Arias, Javier D. Fernández, Miguel A....
EMMCVPR
2011
Springer
14 years 6 months ago
Optimization of Robust Loss Functions for Weakly-Labeled Image Taxonomies: An ImageNet Case Study
The recently proposed ImageNet dataset consists of several million images, each annotated with a single object category. However, these annotations may be imperfect, in the sense t...
Julian John McAuley, Arnau Ramisa, Tibério ...
146
Voted
SIGIR
2009
ACM
16 years 1 months ago
Has adhoc retrieval improved since 1994?
Evaluation forums such as TREC allow systematic measurement and comparison of information retrieval techniques. The goal is consistent improvement, based on reliable comparison of...
Timothy G. Armstrong, Alistair Moffat, William Web...
SIGIR
2005
ACM
16 years 13 days ago
Do summaries help?
We describe a task-based evaluation to determine whether multi-document summaries measurably improve user performance when using online news browsing systems for directed research...
Kathleen McKeown, Rebecca J. Passonneau, David K. ...