Exploiting Curriculum Learning in Unsupervised Neural Machine   Translation

Jinliang Lu; Jiajun Zhang

arXiv:2109.11177·cs.CL·September 24, 2021

Exploiting Curriculum Learning in Unsupervised Neural Machine Translation

Jinliang Lu, Jiajun Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a curriculum learning approach for unsupervised neural machine translation that leverages pseudo data quality assessment to improve translation accuracy and training efficiency.

Contribution

It proposes a novel curriculum learning framework that uses cross-lingual embeddings and model-based quality scores to prioritize pseudo data during training.

Findings

01

Achieves consistent improvements on multiple translation benchmarks.

02

Faster convergence compared to standard unsupervised NMT.

03

Enhances translation quality by focusing on higher-quality pseudo data.

Abstract

Back-translation (BT) has become one of the de facto components in unsupervised neural machine translation (UNMT), and it explicitly makes UNMT have translation ability. However, all the pseudo bi-texts generated by BT are treated equally as clean data during optimization without considering the quality diversity, leading to slow convergence and limited translation performance. To address this problem, we propose a curriculum learning method to gradually utilize pseudo bi-texts based on their quality from multiple granularities. Specifically, we first apply cross-lingual word embedding to calculate the potential translation difficulty (quality) for the monolingual sentences. Then, the sentences are fed into UNMT from easy to hard batch by batch. Furthermore, considering the quality of sentences/tokens in a particular batch are also diverse, we further adopt the model itself to calculate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jinlianglu96/cl_unmt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications