Trivial Transfer Learning for Low-Resource Neural Machine Translation
Tom Kocmi, Ond\v{r}ej Bojar

TL;DR
This paper introduces a straightforward transfer learning approach for low-resource neural machine translation, where a high-resource parent model is fine-tuned on a low-resource target, showing significant improvements even across unrelated languages.
Contribution
It demonstrates a simple transfer learning method that outperforms baselines for low-resource translation, applicable across different languages and scripts.
Findings
Significant performance gains over baseline models.
Effective transfer across unrelated languages.
Applicable without language relatedness or special training tricks.
Abstract
Transfer learning has been proven as an effective technique for neural machine translation under low-resource conditions. Existing methods require a common target language, language relatedness, or specific training tricks and regimes. We present a simple transfer learning method, where we first train a "parent" model for a high-resource language pair and then continue the training on a lowresource pair only by replacing the training corpus. This "child" model performs significantly better than the baseline trained for lowresource pair only. We are the first to show this for targeting different languages, and we observe the improvements even for unrelated languages with different alphabets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
