Beyond MLE: Investigating SEARNN for Low-Resourced Neural Machine Translation
Chris Emezue

TL;DR
This paper investigates SEARNN, an alternative training method to MLE, for low-resource neural machine translation, demonstrating a 5.4% BLEU score improvement on African language translation tasks.
Contribution
It is the first to evaluate SEARNN for low-resource NMT, showing its effectiveness over MLE in challenging language scenarios with limited data.
Findings
SEARNN outperforms MLE with a 5.4% BLEU score increase.
SEARNN effectively handles morphological complexity in low-resource languages.
The approach improves translation quality in African language pairs.
Abstract
Structured prediction tasks, like machine translation, involve learning functions that map structured inputs to structured outputs. Recurrent Neural Networks (RNNs) have historically been a popular choice for such tasks, including in natural language processing (NLP) applications. However, training RNNs using Maximum Likelihood Estimation (MLE) has its limitations, including exposure bias and a mismatch between training and testing metrics. SEARNN, based on the learning to search (L2S) framework, has been proposed as an alternative to MLE for RNN training. This project explored the potential of SEARNN to improve machine translation for low-resourced African languages -- a challenging task characterized by limited training data availability and the morphological complexity of the languages. Through experiments conducted on translation for English to Igbo, French to \ewe, and French to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
