Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages
Carlos Mullov, Ngoc-Quan Pham, Alexander Waibel

TL;DR
This paper introduces a decoupled vocabulary learning approach in multilingual neural machine translation that enables effective zero-shot translation from unseen languages by separating vocabulary learning from syntax training.
Contribution
The authors propose a novel decoupled learning framework that allows zero-shot translation from unseen languages by learning word representations separately and freezing them during translation training.
Findings
Achieved 42.6 BLEU for Portuguese-English zero-shot translation.
Attained 20.7 BLEU for Russian-English zero-shot translation.
Near parity with supervised models in unsupervised translation using iterative back-translation.
Abstract
Multilingual neural machine translation systems learn to map sentences of different languages into a common representation space. Intuitively, with a growing number of seen languages the encoder sentence representation grows more flexible and easily adaptable to new languages. In this work, we test this hypothesis by zero-shot translating from unseen languages. To deal with unknown vocabularies from unknown languages we propose a setup where we decouple learning of vocabulary and syntax, i.e. for each language we learn word representations in a separate step (using cross-lingual word embeddings), and then train to translate while keeping those word representations frozen. We demonstrate that this setup enables zero-shot translation from entirely unseen languages. Zero-shot translating with a model trained on Germanic and Romance languages we achieve scores of 42.6 BLEU for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques
