Curriculum Learning for Domain Adaptation in Neural Machine Translation

Xuan Zhang; Pamela Shapiro; Gaurav Kumar; Paul McNamee; Marine; Carpuat; Kevin Duh

arXiv:1905.05816·cs.CL·May 16, 2019·6 cites

Curriculum Learning for Domain Adaptation in Neural Machine Translation

Xuan Zhang, Pamela Shapiro, Gaurav Kumar, Paul McNamee, Marine, Carpuat, Kevin Duh

PDF

Open Access

TL;DR

This paper presents a curriculum learning method for domain adaptation in neural machine translation, improving performance by training on grouped samples ordered by domain similarity.

Contribution

It introduces a simple, adaptable curriculum learning approach that enhances domain-specific neural machine translation without complex modifications.

Findings

01

Outperforms unadapted models in two domains and language pairs

02

Easy to implement on existing neural frameworks

03

Consistent improvements over baseline methods

Abstract

We introduce a curriculum learning approach to adapt generic neural machine translation models to a specific domain. Samples are grouped by their similarities to the domain of interest and each group is fed to the training algorithm with a particular schedule. This approach is simple to implement on top of any neural framework or architecture, and consistently outperforms both unadapted and adapted baselines in experiments with two distinct domains and two language pairs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research