Understanding and Addressing the Under-Translation Problem from the   Perspective of Decoding Objective

Chenze Shao; Fandong Meng; Jiali Zeng; Jie Zhou

arXiv:2405.18922·cs.CL·May 30, 2024

Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective

Chenze Shao, Fandong Meng, Jiali Zeng, Jie Zhou

PDF

Open Access

TL;DR

This paper analyzes the root cause of under-translation in neural machine translation from the decoding objective perspective, proposing a confidence-based detection and correction method that improves translation completeness.

Contribution

It introduces a novel approach using EOS confidence as an under-translation detector and enhances penalties to address the issue effectively.

Findings

01

Accurately detects under-translation using EOS confidence

02

Improves translation completeness with minimal impact on correct outputs

03

Effective on both synthetic and real-world datasets

Abstract

Neural Machine Translation (NMT) has made remarkable progress over the past years. However, under-translation and over-translation remain two challenging problems in state-of-the-art NMT systems. In this work, we conduct an in-depth analysis on the underlying cause of under-translation in NMT, providing an explanation from the perspective of decoding objective. To optimize the beam search objective, the model tends to overlook words it is less confident about, leading to the under-translation phenomenon. Correspondingly, the model's confidence in predicting the End Of Sentence (EOS) diminishes when under-translation occurs, serving as a mild penalty for under-translated candidates. Building upon this analysis, we propose employing the confidence of predicting EOS as a detector for under-translation, and strengthening the confidence-based penalty to penalize candidates with a high risk…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Reforms and Innovations