In-Domain African Languages Translation Using LLMs and Multi-armed Bandits

Pratik Rakesh Singh; Kritarth Prasad; Mohammadi Zaki; Pankaj Wasnik

arXiv:2505.15069·cs.CL·May 22, 2025

In-Domain African Languages Translation Using LLMs and Multi-armed Bandits

Pratik Rakesh Singh, Kritarth Prasad, Mohammadi Zaki, Pankaj Wasnik

PDF

Open Access

TL;DR

This paper explores using bandit algorithms to select the best neural machine translation model for African languages in domain-specific tasks, especially when data is scarce or unavailable, improving translation quality.

Contribution

It introduces a bandit-based model selection framework tailored for low-resource African language translation, addressing domain adaptation without extensive fine-tuning.

Findings

01

Bandit algorithms effectively select optimal models for African language translation.

02

The approach performs well with limited or no target domain data.

03

Results demonstrate robustness across multiple languages and domains.

Abstract

Neural Machine Translation (NMT) systems face significant challenges when working with low-resource languages, particularly in domain adaptation tasks. These difficulties arise due to limited training data and suboptimal model generalization, As a result, selecting an optimal model for translation is crucial for achieving strong performance on in-domain data, particularly in scenarios where fine-tuning is not feasible or practical. In this paper, we investigate strategies for selecting the most suitable NMT model for a given domain using bandit-based algorithms, including Upper Confidence Bound, Linear UCB, Neural Linear Bandit, and Thompson Sampling. Our method effectively addresses the resource constraints by facilitating optimal model selection with high confidence. We evaluate the approach across three African languages and domains, demonstrating its robustness and effectiveness in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling