Modeling Coverage for Non-Autoregressive Neural Machine Translation

Yong Shan; Yang Feng; Chenze Shao

arXiv:2104.11897·cs.CL·April 27, 2021

Modeling Coverage for Non-Autoregressive Neural Machine Translation

Yong Shan, Yang Feng, Chenze Shao

PDF

Open Access 1 Datasets

TL;DR

This paper introduces a novel coverage modeling approach for Non-Autoregressive Neural Machine Translation to reduce translation errors and improve quality by tracking token coverage during translation.

Contribution

It proposes a coverage-based NAT model with token-level refinement and sentence-level agreement, addressing over- and under-translation issues.

Findings

01

Significant reduction in translation errors.

02

Improved translation quality on WMT datasets.

03

Outperforms baseline NAT systems.

Abstract

Non-Autoregressive Neural Machine Translation (NAT) has achieved significant inference speedup by generating all tokens simultaneously. Despite its high efficiency, NAT usually suffers from two kinds of translation errors: over-translation (e.g. repeated tokens) and under-translation (e.g. missing translations), which eventually limits the translation quality. In this paper, we argue that these issues of NAT can be addressed through coverage modeling, which has been proved to be useful in autoregressive decoding. We propose a novel Coverage-NAT to model the coverage information directly by a token-level coverage iterative refinement mechanism and a sentence-level coverage agreement, which can remind the model if a source token has been translated or not and improve the semantics consistency between the translation and the source, respectively. Experimental results on WMT14 En-De and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications