Modeling Homophone Noise for Robust Neural Machine Translation

Wenjie Qin; Xiang Li; Yuhui Sun; Deyi Xiong; Jianwei Cui; Bin Wang

arXiv:2012.08396·cs.CL·December 16, 2020

Modeling Homophone Noise for Robust Neural Machine Translation

Wenjie Qin, Xiang Li, Yuhui Sun, Deyi Xiong, Jianwei Cui, Bin Wang

PDF

Open Access

TL;DR

This paper introduces a novel homophone noise detection and syllable-aware neural machine translation framework that enhances translation robustness against homophone errors in Chinese-English translation tasks.

Contribution

It presents a combined homophone noise detector and syllable-aware NMT model, improving translation accuracy on noisy and clean texts.

Findings

01

Significant performance gains on noisy test sets.

02

Improved translation quality on clean text.

03

Effective detection and correction of homophone errors.

Abstract

In this paper, we propose a robust neural machine translation (NMT) framework. The framework consists of a homophone noise detector and a syllable-aware NMT model to homophone errors. The detector identifies potential homophone errors in a textual sentence and converts them into syllables to form a mixed sequence that is then fed into the syllable-aware NMT. Extensive experiments on Chinese->English translation demonstrate that our proposed method not only significantly outperforms baselines on noisy test sets with homophone noise, but also achieves a substantial improvement on clean text.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis