Domain Adaptation and Multi-Domain Adaptation for Neural Machine   Translation: A Survey

Danielle Saunders

arXiv:2104.06951·cs.CL·March 23, 2022·6 cites

Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

Danielle Saunders

PDF

Open Access

TL;DR

This survey reviews various domain and multi-domain adaptation techniques for Neural Machine Translation, emphasizing methods that improve translation quality across different domains while addressing challenges like overfitting and catastrophic forgetting.

Contribution

It categorizes and analyzes existing adaptation methods, highlighting their benefits and challenges in multi-domain NMT scenarios.

Findings

01

Data selection and generation improve domain adaptation.

02

Model architecture modifications enhance robustness.

03

Multi-domain adaptation benefits other NMT research areas.

Abstract

The development of deep learning techniques has allowed Neural Machine Translation (NMT) models to become extremely powerful, given sufficient training data and training time. However, systems struggle when translating text from a new domain with a distinct style or vocabulary. Fine-tuning on in-domain data allows good domain adaptation, but requires sufficient relevant bilingual data. Even if this is available, simple fine-tuning can cause overfitting to new data and `catastrophic forgetting' of previously learned behaviour. We concentrate on robust approaches to domain adaptation for NMT, particularly where a system may need to translate across multiple domains. We divide techniques into those revolving around data selection or generation, model architecture, parameter adaptation procedure, and inference procedure. We finally highlight the benefits of domain adaptation and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Domain Adaptation and Few-Shot Learning