Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Machine Translation
Antonio Toral, Sheila Castilho, Ke Hu, Andy Way

TL;DR
This paper critically reevaluates claims of human parity in Chinese-English machine translation by considering overlooked variables, revealing that true parity has not been achieved and emphasizing the importance of evaluation conditions.
Contribution
It introduces new variables into human evaluation of MT, such as source text origin and evaluator proficiency, and provides guidelines for more accurate future assessments.
Findings
Human parity not achieved when considering original source texts.
Expert evaluators yield higher agreement and better discrimination.
Identifies key translation issues in current test sets.
Abstract
We reassess a recent study (Hassan et al., 2018) that claimed that machine translation (MT) has reached human parity for the translation of news from Chinese into English, using pairwise ranking and considering three variables that were not taken into account in that previous study: the language in which the source side of the test set was originally written, the translation proficiency of the evaluators, and the provision of inter-sentential context. If we consider only original source text (i.e. not translated from another language, or translationese), then we find evidence showing that human parity has not been achieved. We compare the judgments of professional translators against those of non-experts and discover that those of the experts result in higher inter-annotator agreement and better discrimination between human and machine translations. In addition, we analyse the human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Explainable Artificial Intelligence (XAI)
