Improving Neural Machine Translation Robustness via Data Augmentation: Beyond Back Translation
Zhenhao Li, Lucia Specia

TL;DR
This paper introduces novel data augmentation techniques and leverages external noisy data sources to enhance neural machine translation robustness against input noise, surpassing reliance on back translation alone.
Contribution
It proposes new data augmentation methods to improve NMT robustness with limited data and explores using external noise sources like speech transcripts.
Findings
Enhanced NMT robustness with proposed augmentation methods.
Utilizing external noise data improves translation resilience.
Models maintain small size while gaining robustness.
Abstract
Neural Machine Translation (NMT) models have been proved strong when translating clean texts, but they are very sensitive to noise in the input. Improving NMT models robustness can be seen as a form of "domain" adaption to noise. The recently created Machine Translation on Noisy Text task corpus provides noisy-clean parallel data for a few language pairs, but this data is very limited in size and diversity. The state-of-the-art approaches are heavily dependent on large volumes of back-translated data. This paper has two main contributions: Firstly, we propose new data augmentation methods to extend limited noisy data and further improve NMT robustness to noise while keeping the models small. Secondly, we explore the effect of utilizing noise from external data in the form of speech transcripts and show that it could help robustness.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
