Unsupervised Domain Adaptation for Neural Machine Translation with   Domain-Aware Feature Embeddings

Zi-Yi Dou; Junjie Hu; Antonios Anastasopoulos; Graham Neubig

arXiv:1908.10430·cs.CL·August 29, 2019

Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings

Zi-Yi Dou, Junjie Hu, Antonios Anastasopoulos, Graham Neubig

PDF

1 Repo

TL;DR

This paper introduces a novel unsupervised domain adaptation method for neural machine translation that uses domain-aware feature embeddings, improving translation quality across different domains, especially when domain-specific data is limited.

Contribution

The paper proposes a new approach using domain-aware feature embeddings learned via an auxiliary language modeling task for better domain adaptation in NMT.

Findings

01

Consistent improvements in translation quality across multiple settings.

02

Combining domain-aware embeddings with back translation yields further gains.

03

Effective control of domain-specific outputs in neural machine translation.

Abstract

The recent success of neural machine translation models relies on the availability of high quality, in-domain data. Domain adaptation is required when domain-specific data is scarce or nonexistent. Previous unsupervised domain adaptation strategies include training the model with in-domain copied monolingual or back-translated data. However, these methods use generic representations for text regardless of domain shift, which makes it infeasible for translation models to control outputs conditional on a specific domain. In this work, we propose an approach that adapts models with domain-aware feature embeddings, which are learned via an auxiliary language modeling task. Our approach allows the model to assign domain-specific representations to words and output sentences in the desired domain. Our empirical results demonstrate the effectiveness of the proposed strategy, achieving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zdou0830/DAFE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.