Simply Trainable Nearest Neighbour Machine Translation with GPU Inference
Hossam Amer, Abdelrahman Abouelenin, Mohamed Maher, Evram Narouz,, Mohamed Afify, and Hany Awadallah

TL;DR
This paper introduces a GPU-efficient, trainable nearest neighbor machine translation method that adaptively constructs small datastores and learns interpolation coefficients, improving translation quality across domains with minimal speed loss.
Contribution
It proposes a simple, trainable interpolation approach for kNN-based machine translation that is automatic and GPU-compatible, enhancing domain adaptation without retraining the entire model.
Findings
Improves or maintains translation quality across domains.
Achieves only 5% speed reduction on GPU inference.
Automatically learns interpolation coefficients for better domain adaptation.
Abstract
Nearest neighbor machine translation is a successful approach for fast domain adaption, which interpolates the pre-trained transformers with domain-specific token-level k-nearest-neighbor (kNN) retrieval without retraining. Despite kNN MT's success, searching large reference corpus and fixed interpolation between the kNN and pre-trained model led to computational complexity and translation quality challenges. Among other papers, Dai et al. proposed methods to obtain a small number of reference samples dynamically for which they introduced a distance-aware interpolation method using an equation that includes free parameters. This paper proposes a simply trainable nearest neighbor machine translation and carry out inference experiments on GPU. Similar to Dai et al., we first adaptively construct a small datastore for each input sentence. Second, we train a single-layer network for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies
