Enhanced back-translation for low resource neural machine translation   using self-training

Idris Abdulmumin; Bashir Shehu Galadanci; Abubakar Isa

arXiv:2006.02876·cs.CL·February 16, 2021

Enhanced back-translation for low resource neural machine translation using self-training

Idris Abdulmumin, Bashir Shehu Galadanci, Abubakar Isa

PDF

TL;DR

This paper introduces a self-training method to enhance back-translation quality in low-resource neural machine translation, leading to improved translation performance by iteratively refining the backward model.

Contribution

It proposes a novel self-training strategy that uses the model's own outputs to improve back-translation quality in low-resource NMT settings.

Findings

01

Improved backward models increased BLEU scores by 11.06 and 1.5 on low-resource datasets.

02

Synthetic data from enhanced backward models led to 2.7 BLEU gain in forward translation.

03

Method outperforms standard back-translation in low-resource neural machine translation.

Abstract

Improving neural machine translation (NMT) models using the back-translations of the monolingual target data (synthetic parallel data) is currently the state-of-the-art approach for training improved translation systems. The quality of the backward system - which is trained on the available parallel data and used for the back-translation - has been shown in many studies to affect the performance of the final NMT model. In low resource conditions, the available parallel data is usually not enough to train a backward model that can produce the qualitative synthetic data needed to train a standard translation model. This work proposes a self-training strategy where the output of the backward model is used to improve the model itself through the forward translation technique. The technique was shown to improve baseline low resource IWSLT'14 English-German and IWSLT'15 English-Vietnamese…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.