Trivial Transfer Learning for Low-Resource Neural Machine Translation

Tom Kocmi; Ond\v{r}ej Bojar

arXiv:1809.00357·cs.CL·July 9, 2020

Trivial Transfer Learning for Low-Resource Neural Machine Translation

Tom Kocmi, Ond\v{r}ej Bojar

PDF

TL;DR

This paper introduces a straightforward transfer learning approach for low-resource neural machine translation, where a high-resource parent model is fine-tuned on a low-resource target, showing significant improvements even across unrelated languages.

Contribution

It demonstrates a simple transfer learning method that outperforms baselines for low-resource translation, applicable across different languages and scripts.

Findings

01

Significant performance gains over baseline models.

02

Effective transfer across unrelated languages.

03

Applicable without language relatedness or special training tricks.

Abstract

Transfer learning has been proven as an effective technique for neural machine translation under low-resource conditions. Existing methods require a common target language, language relatedness, or specific training tricks and regimes. We present a simple transfer learning method, where we first train a "parent" model for a high-resource language pair and then continue the training on a lowresource pair only by replacing the training corpus. This "child" model performs significantly better than the baseline trained for lowresource pair only. We are the first to show this for targeting different languages, and we observe the improvements even for unrelated languages with different alphabets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.