Addressing the Rare Word Problem in Neural Machine Translation
Minh-Thang Luong, Ilya Sutskever, Quoc V. Le, Oriol Vinyals, Wojciech, Zaremba

TL;DR
This paper presents a novel method to improve neural machine translation by effectively handling rare and out-of-vocabulary words through alignment-based post-processing, significantly boosting translation quality.
Contribution
It introduces an alignment-based technique that allows NMT systems to better translate rare words by leveraging source-target alignments and dictionary lookups.
Findings
Achieved up to 2.8 BLEU point improvement over baseline NMT.
First NMT system to surpass the best WMT14 result with 37.5 BLEU.
Effective handling of OOV words enhances translation accuracy.
Abstract
Neural Machine Translation (NMT) is a new approach to machine translation that has shown promising results that are comparable to traditional approaches. A significant weakness in conventional NMT systems is their inability to correctly translate very rare words: end-to-end NMTs tend to have relatively small vocabularies with a single unk symbol that represents every possible out-of-vocabulary (OOV) word. In this paper, we propose and implement an effective technique to address this problem. We train an NMT system on data that is augmented by the output of a word alignment algorithm, allowing the NMT system to emit, for each OOV word in the target sentence, the position of its corresponding word in the source sentence. This information is later utilized in a post-processing step that translates every OOV word using a dictionary. Our experiments on the WMT14 English to French translation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
