Data Diversification: A Simple Strategy For Neural Machine Translation

Xuan-Phi Nguyen; Shafiq Joty; Wu Kui; Ai Ti Aw

arXiv:1911.01986·cs.CL·October 6, 2020·44 cites

Data Diversification: A Simple Strategy For Neural Machine Translation

Xuan-Phi Nguyen, Shafiq Joty, Wu Kui, Ai Ti Aw

PDF

Open Access 2 Repos 1 Video

TL;DR

The paper presents Data Diversification, a straightforward data augmentation technique for neural machine translation that improves performance by merging original data with predictions from multiple models, without extra data or computational costs.

Contribution

It introduces a novel data diversification strategy that enhances NMT performance by leveraging model predictions, outperforming existing methods like knowledge distillation and dual learning.

Findings

01

Achieves state-of-the-art BLEU scores on WMT'14 English-German and English-French tasks.

02

Significantly improves translation quality on multiple low-resource and IWSLT tasks.

03

More effective than knowledge distillation and dual learning methods.

Abstract

We introduce Data Diversification: a simple but effective strategy to boost neural machine translation (NMT) performance. It diversifies the training data by using the predictions of multiple forward and backward models and then merging them with the original dataset on which the final NMT model is trained. Our method is applicable to all NMT models. It does not require extra monolingual data like back-translation, nor does it add more computations and parameters like ensembles of models. Our method achieves state-of-the-art BLEU scores of 30.7 and 43.7 in the WMT'14 English-German and English-French translation tasks, respectively. It also substantially improves on 8 other translation tasks: 4 IWSLT tasks (English-German and English-French) and 4 low-resource translation tasks (English-Nepali and English-Sinhala). We demonstrate that our method is more effective than knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Data Diversification: A Simple Strategy For Neural Machine Translation· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Algorithms and Data Compression

MethodsKnowledge Distillation