Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation
Yong Cheng, Ankur Bapna, Orhan Firat, Yuan Cao, Pidong Wang, Wolfgang, Macherey

TL;DR
This paper introduces a novel instance-level interpolation method called mXEncDec that fuses language pairs in multilingual neural machine translation, significantly enhancing translation quality and model generalization across diverse and out-of-distribution multilingual data.
Contribution
The paper proposes the multilingual crossover encoder-decoder (mXEncDec) approach, which interpolates examples from different language pairs to improve shared representations in multilingual NMT models.
Findings
Significant BLEU score improvements on multiple translation tasks.
Enhanced generalization to out-of-distribution multilingual examples.
Better representation sharing across languages.
Abstract
Multilingual neural machine translation models are trained to maximize the likelihood of a mix of examples drawn from multiple language pairs. The dominant inductive bias applied to these models is a shared vocabulary and a shared set of parameters across languages; the inputs and labels corresponding to examples drawn from different language pairs might still reside in distinct sub-spaces. In this paper, we introduce multilingual crossover encoder-decoder (mXEncDec) to fuse language pairs at an instance level. Our approach interpolates instances from different language pairs into joint `crossover examples' in order to encourage sharing input and output spaces across languages. To ensure better fusion of examples in multilingual settings, we propose several techniques to improve example interpolation across dissimilar languages under heavy data imbalance. Experiments on a large-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
