Incremental Adaptation Strategies for Neural Network Language Models

Aram Ter-Sarkisov; Holger Schwenk; Loic Barrault; Fethi Bougares

arXiv:1412.6650·cs.NE·July 8, 2015

Incremental Adaptation Strategies for Neural Network Language Models

Aram Ter-Sarkisov, Holger Schwenk, Loic Barrault, Fethi Bougares

PDF

TL;DR

This paper introduces efficient incremental adaptation techniques for neural network language models, enabling rapid updates with small data sets without overfitting, thus improving translation quality.

Contribution

It proposes two novel methods—continued training on resampled data and insertion of adaptation layers—for fast neural model adaptation.

Findings

01

Both methods are computationally efficient and fast.

02

They significantly improve translation quality.

03

They prevent overfitting on small adaptation datasets.

Abstract

It is today acknowledged that neural network language models outperform backoff language models in applications like speech recognition or statistical machine translation. However, training these models on large amounts of data can take several days. We present efficient techniques to adapt a neural network language model to new data. Instead of training a completely new model or relying on mixture approaches, we propose two new methods: continued training on resampled data or insertion of adaptation layers. We present experimental results in an CAT environment where the post-edits of professional translators are used to improve an SMT system. Both methods are very fast and achieve significant improvements without overfitting the small adaptation data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.