Attaining the Unattainable? Reassessing Claims of Human Parity in Neural   Machine Translation

Antonio Toral; Sheila Castilho; Ke Hu; Andy Way

arXiv:1808.10432·cs.CL·August 31, 2018·5 cites

Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Machine Translation

Antonio Toral, Sheila Castilho, Ke Hu, Andy Way

PDF

Open Access 1 Repo

TL;DR

This paper critically reevaluates claims of human parity in Chinese-English machine translation by considering overlooked variables, revealing that true parity has not been achieved and emphasizing the importance of evaluation conditions.

Contribution

It introduces new variables into human evaluation of MT, such as source text origin and evaluator proficiency, and provides guidelines for more accurate future assessments.

Findings

01

Human parity not achieved when considering original source texts.

02

Expert evaluators yield higher agreement and better discrimination.

03

Identifies key translation issues in current test sets.

Abstract

We reassess a recent study (Hassan et al., 2018) that claimed that machine translation (MT) has reached human parity for the translation of news from Chinese into English, using pairwise ranking and considering three variables that were not taken into account in that previous study: the language in which the source side of the test set was originally written, the translation proficiency of the evaluators, and the provision of inter-sentential context. If we consider only original source text (i.e. not translated from another language, or translationese), then we find evidence showing that human parity has not been achieved. We compare the judgments of professional translators against those of non-experts and discover that those of the experts result in higher inter-annotator agreement and better discrimination between human and machine translations. In addition, we analyse the human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

antot/human_parity_mt
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Explainable Artificial Intelligence (XAI)