Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Virginia Adams, Sandeep Subramanian, Mike Chrzanowski, Oleksii, Hrinchuk, and Oleksii Kuchaiev

TL;DR
This paper empirically evaluates domain adaptation methods for neural machine translation in low-resource scenarios across multiple domains and languages, providing practical recommendations and datasets.
Contribution
It systematically compares data-centric adaptation techniques in very low and moderately low resource settings, proposing an ensemble approach and releasing relevant datasets and code.
Findings
Ensemble methods improve translation quality in low-resource domains.
Data-centric approaches vary in effectiveness across domains and resource levels.
Practical recommendations for domain adaptation in NMT are provided.
Abstract
General translation models often still struggle to generate accurate translations in specialized domains. To guide machine translation practitioners and characterize the effectiveness of domain adaptation methods under different data availability scenarios, we conduct an in-depth empirical exploration of monolingual and parallel data approaches to domain adaptation of pre-trained, third-party, NMT models in settings where architecture change is impractical. We compare data centric adaptation methods in isolation and combination. We study method effectiveness in very low resource (8k parallel examples) and moderately low resource (46k parallel examples) conditions and propose an ensemble approach to alleviate reductions in original domain translation quality. Our work includes three domains: consumer electronic, clinical, and biomedical and spans four language pairs - Zh-En, Ja-En,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
