Off-the-Shelf Unsupervised NMT
Chris Hokamp, Sebastian Ruder, John Glover

TL;DR
This paper demonstrates that off-the-shelf neural MT architectures can be effectively adapted for unsupervised translation without parallel data, achieving competitive results and extending to low-resource language pairs like English-Turkish.
Contribution
It introduces a novel approach of using off-the-shelf neural MT models for unsupervised translation, combining multi-task learning insights and enabling application to low-resource languages.
Findings
Unsupervised models achieve competitive performance with purpose-built models.
The approach extends to low-resource language pairs like English-Turkish.
Proposed improvements enhance applicability to truly low-resource settings.
Abstract
We frame unsupervised machine translation (MT) in the context of multi-task learning (MTL), combining insights from both directions. We leverage off-the-shelf neural MT architectures to train unsupervised MT models with no parallel data and show that such models can achieve reasonably good performance, competitive with models purpose-built for unsupervised MT. Finally, we propose improvements that allow us to apply our models to English-Turkish, a truly low-resource language pair.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
