Meta-Learning for Low-Resource Neural Machine Translation

Jiatao Gu; Yong Wang; Yun Chen; Kyunghyun Cho; Victor O.K. Li

arXiv:1808.08437·cs.CL·August 28, 2018·30 cites

Meta-Learning for Low-Resource Neural Machine Translation

Jiatao Gu, Yong Wang, Yun Chen, Kyunghyun Cho, Victor O.K. Li

PDF

Open Access

TL;DR

This paper introduces a meta-learning approach to low-resource neural machine translation, enabling effective translation models with minimal training data by leveraging multilingual high-resource language tasks.

Contribution

It extends the MAML algorithm to NMT, utilizing universal lexical representations for cross-lingual transfer in low-resource settings.

Findings

01

Outperforms transfer learning methods in low-resource NMT

02

Achieves high BLEU scores with limited training data

03

Demonstrates effectiveness across 18 European and 5 diverse languages

Abstract

In this paper, we propose to extend the recently introduced model-agnostic meta-learning algorithm (MAML) for low-resource neural machine translation (NMT). We frame low-resource translation as a meta-learning problem, and we learn to adapt to low-resource languages based on multilingual high-resource language tasks. We use the universal lexical representation~\citep{gu2018universal} to overcome the input-output mismatch across different languages. We evaluate the proposed meta-learning strategy using eighteen European languages (Bg, Cs, Da, De, El, Es, Et, Fr, Hu, It, Lt, Nl, Pl, Pt, Sk, Sl, Sv and Ru) as source tasks and five diverse languages (Ro, Lv, Fi, Tr and Ko) as target tasks. We show that the proposed approach significantly outperforms the multilingual, transfer learning based approach~\citep{zoph2016transfer} and enables us to train a competitive NMT system with only a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications