Rethink about the Word-level Quality Estimation for Machine Translation   from Human Judgement

Zhen Yang; Fandong Meng; Yuanmeng Yan; Jie Zhou

arXiv:2209.05695·cs.CL·September 14, 2022

Rethink about the Word-level Quality Estimation for Machine Translation from Human Judgement

Zhen Yang, Fandong Meng, Yuanmeng Yan, Jie Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a human-annotated benchmark dataset for word-level quality estimation in machine translation, addressing limitations of automatic metrics by incorporating expert human judgments and proposing self-supervised strategies to improve artificial data quality.

Contribution

The paper presents HJQE, a human-annotated dataset for word-level QE, and proposes self-supervised tag correction strategies to align artificial QE data with human judgments.

Findings

01

HJQE dataset aligns better with human judgment than existing metrics.

02

Self-supervised tag correction strategies improve artificial QE data quality.

03

Experimental results validate the effectiveness of the proposed methods.

Abstract

Word-level Quality Estimation (QE) of Machine Translation (MT) aims to find out potential translation errors in the translated sentence without reference. Typically, conventional works on word-level QE are designed to predict the translation quality in terms of the post-editing effort, where the word labels ("OK" and "BAD") are automatically generated by comparing words between MT sentences and the post-edited sentences through a Translation Error Rate (TER) toolkit. While the post-editing effort can be used to measure the translation quality to some extent, we find it usually conflicts with the human judgement on whether the word is well or poorly translated. To overcome the limitation, we first create a golden benchmark dataset, namely \emph{HJQE} (Human Judgement on Quality Estimation), where the expert translators directly annotate the poorly translated words on their judgements.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhenyangiacas/hjqe
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification