Domain Robustness in Neural Machine Translation

Mathias M\"uller; Annette Rios; Rico Sennrich

arXiv:1911.03109·cs.CL·September 28, 2020·63 cites

Domain Robustness in Neural Machine Translation

Mathias M\"uller, Annette Rios, Rico Sennrich

PDF

Open Access 2 Repos

TL;DR

This paper investigates the domain robustness of SMT and NMT models, highlighting distinct issues like fluency and adequacy, and evaluates methods to improve their performance on out-of-domain translation tasks.

Contribution

It provides a comparative analysis of SMT and NMT domain robustness issues and empirically evaluates methods to enhance out-of-domain translation quality.

Findings

01

NMT suffers from hallucinations causing low adequacy in unknown domains.

02

Several methods improve domain robustness and BLEU scores.

03

Methods only slightly increase translation adequacy despite higher BLEU scores.

Abstract

Translating text that diverges from the training domain is a key challenge for machine translation. Domain robustness---the generalization of models to unseen test domains---is low for both statistical (SMT) and neural machine translation (NMT). In this paper, we study the performance of SMT and NMT models on out-of-domain test sets. We find that in unknown domains, SMT and NMT suffer from very different problems: SMT systems are mostly adequate but not fluent, while NMT systems are mostly fluent, but not adequate. For NMT, we identify such hallucinations (translations that are fluent but unrelated to the source) as a key reason for low domain robustness. To mitigate this problem, we empirically compare methods that are reported to improve adequacy or in-domain robustness in terms of their effectiveness at improving domain robustness. In experiments on German to English OPUS data, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research

MethodsTest