Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval
Yan Gao, Zhiwei Cao, Zhongjian Miao, Baosong Yang, Shiyu Liu, Min, Zhang, Jinsong Su

TL;DR
This paper introduces a dynamic retrieval method for k-Nearest-Neighbor Machine Translation that improves efficiency by intelligently deciding when to perform retrieval, addressing limitations of previous adaptive approaches.
Contribution
It proposes a novel kNN-MT with dynamic retrieval using a classifier and timestep-aware threshold, enhancing efficiency and accuracy over existing methods.
Findings
Significantly reduces retrieval time without sacrificing translation quality.
Outperforms previous adaptive kNN-MT methods on standard datasets.
Demonstrates generality across different translation tasks.
Abstract
To achieve non-parametric NMT domain adaptation, -Nearest-Neighbor Machine Translation (NN-MT) constructs an external datastore to store domain-specific translation knowledge, which derives a NN distribution to interpolate the prediction distribution of the NMT model via a linear interpolation coefficient . Despite its success, NN retrieval at each timestep leads to substantial time overhead. To address this issue, dominant studies resort to NN-MT with adaptive retrieval (NN-MT-AR), which dynamically estimates and skips NN retrieval if is less than a fixed threshold. Unfortunately, NN-MT-AR does not yield satisfactory results. In this paper, we first conduct a preliminary study to reveal two key limitations of NN-MT-AR: 1) the optimization gap leads to inaccurate estimation of for determining NN retrieval skipping, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text and Document Classification Technologies · Algorithms and Data Compression
