Finding the Right Recipe for Low Resource Domain Adaptation in Neural   Machine Translation

Virginia Adams; Sandeep Subramanian; Mike Chrzanowski; Oleksii; Hrinchuk; and Oleksii Kuchaiev

arXiv:2206.01137·cs.CL·June 3, 2022

Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation

Virginia Adams, Sandeep Subramanian, Mike Chrzanowski, Oleksii, Hrinchuk, and Oleksii Kuchaiev

PDF

Open Access

TL;DR

This paper empirically evaluates domain adaptation methods for neural machine translation in low-resource scenarios across multiple domains and languages, providing practical recommendations and datasets.

Contribution

It systematically compares data-centric adaptation techniques in very low and moderately low resource settings, proposing an ensemble approach and releasing relevant datasets and code.

Findings

01

Ensemble methods improve translation quality in low-resource domains.

02

Data-centric approaches vary in effectiveness across domains and resource levels.

03

Practical recommendations for domain adaptation in NMT are provided.

Abstract

General translation models often still struggle to generate accurate translations in specialized domains. To guide machine translation practitioners and characterize the effectiveness of domain adaptation methods under different data availability scenarios, we conduct an in-depth empirical exploration of monolingual and parallel data approaches to domain adaptation of pre-trained, third-party, NMT models in settings where architecture change is impractical. We compare data centric adaptation methods in isolation and combination. We study method effectiveness in very low resource (8k parallel examples) and moderately low resource (46k parallel examples) conditions and propose an ensemble approach to alleviate reductions in original domain translation quality. Our work includes three domains: consumer electronic, clinical, and biomedical and spans four language pairs - Zh-En, Ja-En,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification