Learning Kernel-Smoothed Machine Translation with Retrieved Examples

Qingnan Jiang; Mingxuan Wang; Jun Cao; Shanbo Cheng; Shujian Huang and; Lei Li

arXiv:2109.09991·cs.CL·December 9, 2021

Learning Kernel-Smoothed Machine Translation with Retrieved Examples

Qingnan Jiang, Mingxuan Wang, Jun Cao, Shanbo Cheng, Shujian Huang and, Lei Li

PDF

2 Repos

TL;DR

This paper introduces KSTER, a kernel-smoothed approach for online adaptation of neural machine translation models that improves translation quality without retraining, by effectively leveraging retrieved examples.

Contribution

The paper proposes a novel kernel-smoothed method for online NMT adaptation that outperforms existing methods without retraining.

Findings

01

Achieves 1.1 to 1.5 BLEU score improvements over existing methods.

02

Effective in domain adaptation and multi-domain translation tasks.

03

Does not require expensive retraining of models.

Abstract

How to effectively adapt neural machine translation (NMT) models according to emerging cases without retraining? Despite the great success of neural machine translation, updating the deployed models online remains a challenge. Existing non-parametric approaches that retrieve similar examples from a database to guide the translation process are promising but are prone to overfit the retrieved examples. In this work, we propose to learn Kernel-Smoothed Translation with Example Retrieval (KSTER), an effective approach to adapt neural machine translation models online. Experiments on domain adaptation and multi-domain machine translation datasets show that even without expensive retraining, KSTER is able to achieve improvement of 1.1 to 1.5 BLEU scores over the best existing online adaptation methods. The code and trained models are released at https://github.com/jiangqn/KSTER.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.