Off-the-Shelf Unsupervised NMT

Chris Hokamp; Sebastian Ruder; John Glover

arXiv:1811.02278·cs.CL·November 7, 2018·1 cites

Off-the-Shelf Unsupervised NMT

Chris Hokamp, Sebastian Ruder, John Glover

PDF

Open Access

TL;DR

This paper demonstrates that off-the-shelf neural MT architectures can be effectively adapted for unsupervised translation without parallel data, achieving competitive results and extending to low-resource language pairs like English-Turkish.

Contribution

It introduces a novel approach of using off-the-shelf neural MT models for unsupervised translation, combining multi-task learning insights and enabling application to low-resource languages.

Findings

01

Unsupervised models achieve competitive performance with purpose-built models.

02

The approach extends to low-resource language pairs like English-Turkish.

03

Proposed improvements enhance applicability to truly low-resource settings.

Abstract

We frame unsupervised machine translation (MT) in the context of multi-task learning (MTL), combining insights from both directions. We leverage off-the-shelf neural MT architectures to train unsupervised MT models with no parallel data and show that such models can achieve reasonably good performance, competitive with models purpose-built for unsupervised MT. Finally, we propose improvements that allow us to apply our models to English-Turkish, a truly low-resource language pair.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications