Active Learning for Neural Machine Translation
Neeraj Vashistha, Kriti Singh, Ramakant Shakya

TL;DR
This paper explores the use of active learning techniques to improve neural machine translation for low-resource languages, demonstrating faster convergence and higher translation quality with transformer-based models.
Contribution
It introduces active learning methods into NMT training, specifically for low-resource languages, and evaluates their effectiveness using transformer models and BLEU scores.
Findings
Active learning accelerates model convergence.
Active learning improves translation quality.
Transformer-based models benefit from active learning techniques.
Abstract
The machine translation mechanism translates texts automatically between different natural languages, and Neural Machine Translation (NMT) has gained attention for its rational context analysis and fluent translation accuracy. However, processing low-resource languages that lack relevant training attributes like supervised data is a current challenge for Natural Language Processing (NLP). We incorporated a technique known Active Learning with the NMT toolkit Joey NMT to reach sufficient accuracy and robust predictions of low-resource language translation. With active learning, a semi-supervised machine learning strategy, the training algorithm determines which unlabeled data would be the most beneficial for obtaining labels using selected query techniques. We implemented two model-driven acquisition functions for selecting the samples to be validated. This work uses transformer-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Machine Learning and Algorithms · Topic Modeling
