Optimizing the Training Schedule of Multilingual NMT using Reinforcement Learning

Alexis Allemann; \`Alex R. Atrio; Andrei Popescu-Belis

arXiv:2410.06118·cs.CL·June 3, 2025

Optimizing the Training Schedule of Multilingual NMT using Reinforcement Learning

Alexis Allemann, \`Alex R. Atrio, Andrei Popescu-Belis

PDF

Open Access 1 Repo

TL;DR

This paper introduces reinforcement learning algorithms to optimize the training schedule of multilingual neural machine translation, significantly improving translation quality for low-resource languages by intelligently ordering language presentations.

Contribution

It proposes two reinforcement learning methods, Teacher-Student Curriculum Learning and Deep Q Network, to optimize multilingual NMT training schedules, a novel approach in this context.

Findings

01

Deep Q Network improves BLEU and COMET scores

02

Optimized schedules outperform random and shuffled baselines

03

Effective adjustment of language presentation frequency

Abstract

Multilingual NMT is a viable solution for translating low-resource languages (LRLs) when data from high-resource languages (HRLs) from the same language family is available. However, the training schedule, i.e. the order of presentation of languages, has an impact on the quality of such systems. Here, in a many-to-one translation setting, we propose to apply two algorithms that use reinforcement learning to optimize the training schedule of NMT: (1) Teacher-Student Curriculum Learning and (2) Deep Q Network. The former uses an exponentially smoothed estimate of the returns of each action based on the loss on monolingual or multilingual development subsets, while the latter estimates rewards using an additional neural network trained from the history of actions selected in different states of the system, together with the rewards received. On a 8-to-1 translation dataset with LRLs and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alexis-allemann/OpenNMT-py
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Technology and Assessment