Unsupervised Transfer Learning in Multilingual Neural Machine   Translation with Cross-Lingual Word Embeddings

Carlos Mullov; Ngoc-Quan Pham; Alexander Waibel

arXiv:2103.06689·cs.CL·March 12, 2021·1 cites

Unsupervised Transfer Learning in Multilingual Neural Machine Translation with Cross-Lingual Word Embeddings

Carlos Mullov, Ngoc-Quan Pham, Alexander Waibel

PDF

Open Access

TL;DR

This paper explores unsupervised methods for adding new languages to multilingual neural machine translation systems using cross-lingual embeddings, achieving competitive translation quality without direct bilingual data.

Contribution

It introduces a novel approach combining cross-lingual embeddings and blind decoding for unsupervised language addition in NMT, with practical adaptation via non-iterative backtranslation.

Findings

01

Blind decoding achieves 36.4 BLEU for Portuguese-English.

02

Autoencoder training yields up to 28 BLEU with noise.

03

Non-iterative backtranslation attains 34.6 BLEU, close to supervised models.

Abstract

In this work we look into adding a new language to a multilingual NMT system in an unsupervised fashion. Under the utilization of pre-trained cross-lingual word embeddings we seek to exploit a language independent multilingual sentence representation to easily generalize to a new language. While using cross-lingual embeddings for word lookup we decode from a yet entirely unseen source language in a process we call blind decoding. Blindly decoding from Portuguese using a basesystem containing several Romance languages we achieve scores of 36.4 BLEU for Portuguese-English and 12.8 BLEU for Russian-English. In an attempt to train the mapping from the encoder sentence representation to a new target language we use our model as an autoencoder. Merely training to translate from Portuguese to Portuguese while freezing the encoder we achieve 26 BLEU on English-Portuguese, and up to 28 BLEU when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis