Active Learning for Neural Machine Translation

Neeraj Vashistha; Kriti Singh; Ramakant Shakya

arXiv:2301.00688·cs.CL·January 3, 2023

Active Learning for Neural Machine Translation

Neeraj Vashistha, Kriti Singh, Ramakant Shakya

PDF

Open Access 1 Repo

TL;DR

This paper explores the use of active learning techniques to improve neural machine translation for low-resource languages, demonstrating faster convergence and higher translation quality with transformer-based models.

Contribution

It introduces active learning methods into NMT training, specifically for low-resource languages, and evaluates their effectiveness using transformer models and BLEU scores.

Findings

01

Active learning accelerates model convergence.

02

Active learning improves translation quality.

03

Transformer-based models benefit from active learning techniques.

Abstract

The machine translation mechanism translates texts automatically between different natural languages, and Neural Machine Translation (NMT) has gained attention for its rational context analysis and fluent translation accuracy. However, processing low-resource languages that lack relevant training attributes like supervised data is a current challenge for Natural Language Processing (NLP). We incorporated a technique known Active Learning with the NMT toolkit Joey NMT to reach sufficient accuracy and robust predictions of low-resource language translation. With active learning, a semi-supervised machine learning strategy, the training algorithm determines which unlabeled data would be the most beneficial for obtaining labels using selected query techniques. We implemented two model-driven acquisition functions for selecting the samples to be validated. This work uses transformer-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kritisingh24/active_learning_nmt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Machine Learning and Algorithms · Topic Modeling