Beyond MLE: Investigating SEARNN for Low-Resourced Neural Machine   Translation

Chris Emezue

arXiv:2405.11819·cs.CL·May 21, 2024

Beyond MLE: Investigating SEARNN for Low-Resourced Neural Machine Translation

Chris Emezue

PDF

Open Access

TL;DR

This paper investigates SEARNN, an alternative training method to MLE, for low-resource neural machine translation, demonstrating a 5.4% BLEU score improvement on African language translation tasks.

Contribution

It is the first to evaluate SEARNN for low-resource NMT, showing its effectiveness over MLE in challenging language scenarios with limited data.

Findings

01

SEARNN outperforms MLE with a 5.4% BLEU score increase.

02

SEARNN effectively handles morphological complexity in low-resource languages.

03

The approach improves translation quality in African language pairs.

Abstract

Structured prediction tasks, like machine translation, involve learning functions that map structured inputs to structured outputs. Recurrent Neural Networks (RNNs) have historically been a popular choice for such tasks, including in natural language processing (NLP) applications. However, training RNNs using Maximum Likelihood Estimation (MLE) has its limitations, including exposure bias and a mismatch between training and testing metrics. SEARNN, based on the learning to search (L2S) framework, has been proposed as an alternative to MLE for RNN training. This project explored the potential of SEARNN to improve machine translation for low-resourced African languages -- a challenging task characterized by limited training data availability and the morphological complexity of the languages. Through experiments conducted on translation for English to Igbo, French to \ewe, and French to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques