Measuring Uncertainty in Translation Quality Evaluation (TQE)
Serge Gladkoff, Irina Sorokina, Lifeng Han, Alexandra Alekseeva

TL;DR
This paper investigates how to efficiently and reliably estimate translation quality by determining the optimal sample size for human and automated evaluations, using statistical models to improve TQE accuracy.
Contribution
It introduces a statistical framework using BSDM and MCSA to determine the optimal sample size for reliable translation quality estimation.
Findings
Optimal sample size depends on desired confidence levels and evaluation variability.
Statistical models can improve the reliability of human and machine translation assessments.
Methodology reduces evaluation costs while maintaining quality assessment accuracy.
Abstract
From both human translators (HT) and machine translation (MT) researchers' point of view, translation quality evaluation (TQE) is an essential task. Translation service providers (TSPs) have to deliver large volumes of translations which meet customer specifications with harsh constraints of required quality level in tight time-frames and costs. MT researchers strive to make their models better, which also requires reliable quality evaluation. While automatic machine translation evaluation (MTE) metrics and quality estimation (QE) tools are widely available and easy to access, existing automated tools are not good enough, and human assessment from professional translators (HAP) are often chosen as the golden standard \cite{han-etal-2021-TQA}. Human evaluations, however, are often accused of having low reliability and agreement. Is this caused by subjectivity or statistics is at play?…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Data Quality and Management
Methodstravel james
