N-Gram Nearest Neighbor Machine Translation
Rui Lv, Junliang Guo, Rui Wang, Xu Tan, Qi Liu, Tao Qin

TL;DR
This paper introduces an n-gram based nearest neighbor retrieval method for machine translation, improving translation accuracy for both autoregressive and non-autoregressive models across general and domain-specific tasks.
Contribution
The authors propose a model-agnostic n-gram retrieval approach that enhances nearest neighbor translation by concatenating adjacent hidden representations, applicable to both AT and NAT models.
Findings
Outperforms token-level retrieval on both AT and NAT models.
Achieves 1.03 BLEU improvement on domain adaptation for AT.
Achieves 2.76 BLEU improvement on domain adaptation for NAT.
Abstract
Nearest neighbor machine translation augments the Autoregressive Translation~(AT) with -nearest-neighbor retrieval, by comparing the similarity between the token-level context representations of the target tokens in the query and the datastore. However, the token-level representation may introduce noise when translating ambiguous words, or fail to provide accurate retrieval results when the representation generated by the model contains indistinguishable context information, e.g., Non-Autoregressive Translation~(NAT) models. In this paper, we propose a novel -gram nearest neighbor retrieval method that is model agnostic and applicable to both AT and NAT models. Specifically, we concatenate the adjacent -gram hidden representations as the key, while the tuple of corresponding target tokens is the value. In inference, we propose tailored decoding algorithms for AT and NAT models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies
Methodsfail
