Self-Paced Learning for Neural Machine Translation

Yu Wan; Baosong Yang; Derek F. Wong; Yikai Zhou; Lidia S. Chao; Haibo; Zhang; Boxing Chen

arXiv:2010.04505·cs.CL·October 20, 2022

Self-Paced Learning for Neural Machine Translation

Yu Wan, Baosong Yang, Derek F. Wong, Yikai Zhou, Lidia S. Chao, Haibo, Zhang, Boxing Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-paced learning approach for neural machine translation that automatically assesses training example difficulty and adjusts learning, resulting in improved translation quality and faster convergence compared to traditional curriculum methods.

Contribution

It proposes a novel self-paced learning framework for NMT that eliminates reliance on handcrafted curricula by enabling the model to self-regulate learning based on confidence measures.

Findings

01

Outperforms strong baselines in translation quality

02

Achieves faster convergence during training

03

Demonstrates robustness across multiple translation tasks

Abstract

Recent studies have proven that the training of neural machine translation (NMT) can be facilitated by mimicking the learning process of humans. Nevertheless, achievements of such kind of curriculum learning rely on the quality of artificial schedule drawn up with the handcrafted features, e.g. sentence length or word rarity. We ameliorate this procedure with a more flexible manner by proposing self-paced learning, where NMT model is allowed to 1) automatically quantify the learning confidence over training examples; and 2) flexibly govern its learning via regulating the loss in each iteration step. Experimental results over multiple translation tasks demonstrate that the proposed model yields better performance than strong baselines and those models trained with human-designed curricula on both translation quality and convergence speed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wanyu2018umac/Self-Paced-Learning-for-Neural-Machine-Translation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification