Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation
Xin Zheng, Zhirui Zhang, Shujian Huang, Boxing Chen, Jun Xie, Weihua, Luo, Jiajun Chen

TL;DR
This paper introduces a novel unsupervised domain adaptation method for neural machine translation that leverages target-side monolingual data with a $k$NN retrieval framework, eliminating the need for parallel corpora.
Contribution
It proposes a new framework using target monolingual sentences and an autoencoder-based approach with adapters to enhance NMT domain adaptation without parallel data.
Findings
Significantly improves translation accuracy with target monolingual data
Achieves comparable performance to back-translation methods
Effective for multi-domain NMT adaptation
Abstract
Recently, NN-MT has shown the promising capability of directly incorporating the pre-trained neural machine translation (NMT) model with domain-specific token-level -nearest-neighbor (NN) retrieval to achieve domain adaptation without retraining. Despite being conceptually attractive, it heavily relies on high-quality in-domain parallel corpora, limiting its capability on unsupervised domain adaptation, where in-domain parallel corpora are scarce or nonexistent. In this paper, we propose a novel framework that directly uses in-domain monolingual sentences in the target language to construct an effective datastore for -nearest-neighbor retrieval. To this end, we first introduce an autoencoder task based on the target language, and then insert lightweight adapters into the original NMT model to map the token-level representation of this task to the ideal representation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
