Dynamic Data Selection for Neural Machine Translation

Marlies van der Wees; Arianna Bisazza; Christof Monz

arXiv:1708.00712·cs.CL·August 3, 2017

Dynamic Data Selection for Neural Machine Translation

Marlies van der Wees, Arianna Bisazza, Christof Monz

PDF

1 Repo

TL;DR

This paper investigates the effectiveness of data selection techniques for neural machine translation and introduces a dynamic data selection method called gradual fine-tuning, which improves translation quality.

Contribution

The paper demonstrates that dynamic data selection, specifically gradual fine-tuning, enhances NMT performance beyond traditional static data selection methods.

Findings

01

Gradual fine-tuning improves BLEU scores by up to +3.1 over baseline.

02

Data selection benefits are less pronounced in NMT compared to PBMT.

03

Dynamic data selection outperforms static approaches in NMT training.

Abstract

Intelligent selection of training data has proven a successful technique to simultaneously increase training efficiency and translation performance for phrase-based machine translation (PBMT). With the recent increase in popularity of neural machine translation (NMT), we explore in this paper to what extent and how NMT can also benefit from data selection. While state-of-the-art data selection (Axelrod et al., 2011) consistently performs well for PBMT, we show that gains are substantially lower for NMT. Next, we introduce dynamic data selection for NMT, a method in which we vary the selected subset of training data between different training epochs. Our experiments show that the best results are achieved when applying a technique we call gradual fine-tuning, with improvements up to +2.6 BLEU over the original data selection approach and up to +3.1 BLEU over a general baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

marliesvanderwees/dds-nmt
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.