Bemba Speech Translation: Exploring a Low-Resource African Language

Muhammad Hazim Al Farouq; Aman Kassahun Wassie; Yasmin Moslem

arXiv:2505.02518·cs.CL·August 14, 2025

Bemba Speech Translation: Exploring a Low-Resource African Language

Muhammad Hazim Al Farouq, Aman Kassahun Wassie, Yasmin Moslem

PDF

TL;DR

This paper presents a speech translation system for Bemba, a low-resource African language, utilizing cascaded models, data augmentation, and synthetic data to improve translation quality.

Contribution

It introduces a novel approach combining Whisper and NLLB-200 models with data augmentation techniques for Bemba speech translation.

Findings

01

Synthetic data improves translation accuracy

02

Data augmentation enhances low-resource language translation

03

Cascaded systems outperform baseline models

Abstract

This paper describes our system submission to the International Conference on Spoken Language Translation (IWSLT 2025), low-resource languages track, namely for Bemba-to-English speech translation. We built cascaded speech translation systems based on Whisper and NLLB-200, and employed data augmentation techniques, such as back-translation. We investigate the effect of using synthetic data and discuss our experimental setup.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.