Towards Continual Learning for Multilingual Machine Translation via   Vocabulary Substitution

Xavier Garcia; Noah Constant; Ankur P. Parikh; Orhan Firat

arXiv:2103.06799·cs.CL·March 12, 2021

Towards Continual Learning for Multilingual Machine Translation via Vocabulary Substitution

Xavier Garcia, Noah Constant, Ankur P. Parikh, Orhan Firat

PDF

TL;DR

This paper introduces a simple vocabulary adaptation method to enhance multilingual machine translation models, enabling efficient continual learning across languages, including distant and unseen scripts, with minimal performance loss.

Contribution

It presents a novel vocabulary substitution scheme that allows scalable, effective continual learning for multilingual translation, even with limited data for new languages.

Findings

01

Minor degradation on original language pairs

02

Effective for distant languages with unseen scripts

03

Competitive performance with only monolingual data

Abstract

We propose a straightforward vocabulary adaptation scheme to extend the language capacity of multilingual machine translation models, paving the way towards efficient continual learning for multilingual machine translation. Our approach is suitable for large-scale datasets, applies to distant languages with unseen scripts, incurs only minor degradation on the translation performance for the original language pairs and provides competitive performance even in the case where we only possess monolingual data for the new languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.