Sciweavers

5489 search results - page 150 / 1098
» Evaluating evaluation measure stability
Sort
View
INTERACT
2003
15 years 7 months ago
Managing the 'Evaluator Effect' in User Testing
: If multiple evaluators analyse the outcomes of a single user test, the agreement between their lists of identified usability problems tends to be limited. This is called the ‘e...
Arnold P. O. S. Vermeeren, Ilse van Kesteren, Math...
LREC
2010
164views Education» more  LREC 2010»
15 years 8 months ago
Evaluating Machine Translation Utility via Semantic Role Labels
We present the methodology that underlies new metrics for semantic machine translation evaluation that we are developing. Unlike widely-used lexical and n-gram based MT evaluation...
Chi-kiu Lo, Dekai Wu
ICCBR
2010
Springer
15 years 5 months ago
Applying Machine Translation Evaluation Techniques to Textual CBR
The need for automated text evaluation is common to several AI disciplines. In this work, we explore the use of Machine Translation (MT) evaluation metrics for Textual Case Based R...
Ibrahim Adeyanju, Nirmalie Wiratunga, Robert Lothi...
TASLP
2010
108views more  TASLP 2010»
15 years 4 months ago
Exploring Correlation Between ROUGE and Human Evaluation on Meeting Summaries
Abstract—Automatic summarization evaluation is very important to the development of summarization systems. In text summarization, ROUGE has been shown to correlate well with huma...
Feifan Liu, Yang Liu
SIGIR
2006
ACM
16 years 13 days ago
Minimal test collections for retrieval evaluation
Accurate estimation of information retrieval evaluation metrics such as average precision require large sets of relevance judgments. Building sets large enough for evaluation of r...
Ben Carterette, James Allan, Ramesh K. Sitaraman