Non-Parametric Online Learning from Human Feedback for Neural Machine Translation
Dongqi Wang, Haoran Wei, Zhirui Zhang, Shujian Huang, Jun Xie, Jiajun, Chen

TL;DR
This paper introduces a non-parametric online learning approach for neural machine translation that leverages human feedback without altering the model structure, improving translation accuracy and adaptation efficiency.
Contribution
It presents a novel k-nearest-neighbor based method that memorizes human feedback and adaptively balances it with the original model, avoiding complex online updates.
Findings
Significant improvements in translation accuracy on benchmarks
Enhanced adaptation with fewer human corrections
No need for online model updates or additional networks
Abstract
We study the problem of online learning with human feedback in the human-in-the-loop machine translation, in which the human translators revise the machine-generated translations and then the corrected translations are used to improve the neural machine translation (NMT) system. However, previous methods require online model updating or additional translation memory networks to achieve high-quality performance, making them inflexible and inefficient in practice. In this paper, we propose a novel non-parametric online learning method without changing the model structure. This approach introduces two k-nearest-neighbor (knn) modules: one module memorizes the human feedback, which is the correct sentences provided by human translators, while the other balances the usage of the history human feedback and original NMT models adaptively. Experiments conducted on EMEA and JRC-Acquis benchmarks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsEntropy Minimized Ensemble of Adapters
