Sciweavers

479 search results - page 36 / 96
» Statistical Significance Tests for Machine Translation Evalu...
Sort
View
ACL
1997
15 years 7 months ago
Probing the Lexicon in Evaluating Commercial MT Systems
In the past the evaluation of machine translation systems has focused on single system evaluations because there were only few systems available. But now there are several commerc...
Martin Volk
ICML
2001
IEEE
16 years 7 months ago
Direct Policy Search using Paired Statistical Tests
Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...
Malcolm J. A. Strens, Andrew W. Moore
JETAI
2010
56views more  JETAI 2010»
15 years 4 months ago
Warning: statistical benchmarking is addictive. Kicking the habit in machine learning
Algorithm performance evaluation is so entrenched in the Machine Learning community that one could call it an addiction. Like most addictions, it is harmful and very difficult to ...
Chris Drummond, Nathalie Japkowicz
LREC
2008
104views Education» more  LREC 2008»
15 years 7 months ago
Performance Evaluation of Speech Translation Systems
One of the most challenging tasks for uniformed service personnel serving in foreign countries is effective verbal communication with the local population. To remedy this problem,...
Brian A. Weiss, Craig Schlenoff, Greg Sanders, Mic...
COLING
1996
15 years 7 months ago
Towards a More Careful Evaluation of Broad Coverage Parsing Systems
Since treebanks have become available to researchers a wide variety of techniques has been used to make broad coverage parsing systems. This makes quantitative evaluation very imp...
Wide R. Hogenhout, Yuji Matsumoto