Learning Policies for Multilingual Training of Neural Machine   Translation Systems

Gaurav Kumar; Philipp Koehn; Sanjeev Khudanpur

arXiv:2103.06964·cs.CL·March 15, 2021

Learning Policies for Multilingual Training of Neural Machine Translation Systems

Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur

PDF

Open Access

TL;DR

This paper introduces search-based and learned curricula for multilingual neural machine translation, enhancing low-resource translation performance by optimizing training data orderings and jointly learning curricula with the translation model.

Contribution

It proposes simple search-based curricula and a novel method to learn curricula from scratch using multi-arm bandits, improving low-resource MNMT performance.

Findings

01

Learned curricula provide better fine-tuning starting points.

02

Curricula improve overall translation accuracy on FLORES dataset.

03

Joint curriculum learning enhances low-resource translation quality.

Abstract

Low-resource Multilingual Neural Machine Translation (MNMT) is typically tasked with improving the translation performance on one or more language pairs with the aid of high-resource language pairs. In this paper, we propose two simple search based curricula -- orderings of the multilingual training data -- which help improve translation performance in conjunction with existing techniques such as fine-tuning. Additionally, we attempt to learn a curriculum for MNMT from scratch jointly with the training of the translation system with the aid of contextual multi-arm bandits. We show on the FLORES low-resource translation dataset that these learned curricula can provide better starting points for fine tuning and improve overall performance of the translation system.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications