Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation
Xiao Pu, Nikolaos Pappas, James Henderson, Andrei Popescu-Belis

TL;DR
This paper shows that integrating weakly supervised word sense disambiguation into neural machine translation enhances translation quality by considering broader context and sense-specific vectors, leading to significant BLEU score improvements.
Contribution
The paper introduces three adaptive clustering algorithms for WSD and demonstrates their integration into NMT, resulting in improved translation accuracy for ambiguous words.
Findings
Over one BLEU point improvement on five language pairs
+4% accuracy on ambiguous nouns and verbs
+20% manual scoring improvement on challenging words
Abstract
This paper demonstrates that word sense disambiguation (WSD) can improve neural machine translation (NMT) by widening the source context considered when modeling the senses of potentially ambiguous words. We first introduce three adaptive clustering algorithms for WSD, based on k-means, Chinese restaurant processes, and random walks, which are then applied to large word contexts represented in a low-rank space and evaluated on SemEval shared-task data. We then learn word vectors jointly with sense vectors defined by our best WSD method, within a state-of-the-art NMT system. We show that the concatenation of these vectors, and the use of a sense selection mechanism based on the weighted average of sense vectors, outperforms several baselines including sense-aware ones. This is demonstrated by translation on five language pairs. The improvements are above one BLEU point over strong NMT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
