Unsupervised Neural Machine Translation

Mikel Artetxe; Gorka Labaka; Eneko Agirre; Kyunghyun Cho

arXiv:1710.11041·cs.CL·February 27, 2018·97 cites

Unsupervised Neural Machine Translation

Mikel Artetxe, Gorka Labaka, Eneko Agirre, Kyunghyun Cho

PDF

Open Access 2 Repos

TL;DR

This paper introduces a fully unsupervised neural machine translation method that trains on monolingual data alone, achieving competitive translation quality without requiring parallel corpora.

Contribution

It presents a novel unsupervised NMT approach that relies solely on monolingual data, eliminating the need for parallel datasets and simplifying training.

Findings

01

Achieves 15.56 BLEU on French-English translation without parallel data.

02

Attains 10.21 BLEU on German-English translation without parallel data.

03

Improves to 21.81 and 15.24 BLEU when combined with small parallel corpora.

Abstract

In spite of the recent success of neural machine translation (NMT) in standard benchmarks, the lack of large parallel corpora poses a major practical problem for many language pairs. There have been several proposals to alleviate this issue with, for instance, triangulation and semi-supervised learning techniques, but they still require a strong cross-lingual signal. In this work, we completely remove the need of parallel data and propose a novel method to train an NMT system in a completely unsupervised manner, relying on nothing but monolingual corpora. Our model builds upon the recent work on unsupervised embedding mappings, and consists of a slightly modified attentional encoder-decoder model that can be trained on monolingual corpora alone using a combination of denoising and backtranslation. Despite the simplicity of the approach, our system obtains 15.56 and 10.21 BLEU points in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications