An Empirical Comparison of Simple Domain Adaptation Methods for Neural   Machine Translation

Chenhui Chu; Raj Dabre; and Sadao Kurohashi

arXiv:1701.03214·cs.CL·February 7, 2017·51 cites

An Empirical Comparison of Simple Domain Adaptation Methods for Neural Machine Translation

Chenhui Chu, Raj Dabre, and Sadao Kurohashi

PDF

Open Access

TL;DR

This paper introduces a new domain adaptation technique for neural machine translation called 'mixed fine tuning', which combines fine tuning and multi-domain training with artificial tags, and compares it empirically to existing methods.

Contribution

The paper presents a novel domain adaptation method for NMT that integrates fine tuning and multi-domain training with artificial tags, and provides an empirical comparison with existing approaches.

Findings

01

Mixed fine tuning improves translation quality over baseline methods.

02

Artificial tags effectively indicate domain-specific information.

03

The method has certain limitations in specific domain scenarios.

Abstract

In this paper, we propose a novel domain adaptation method named "mixed fine tuning" for neural machine translation (NMT). We combine two existing approaches namely fine tuning and multi domain NMT. We first train an NMT model on an out-of-domain parallel corpus, and then fine tune it on a parallel corpus which is a mix of the in-domain and out-of-domain corpora. All corpora are augmented with artificial tags to indicate specific domains. We empirically compare our proposed method against fine tuning and multi domain methods and discuss its benefits and shortcomings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications