Improving Multilingual Math Reasoning for African Languages

Odunayo Ogundepo; Akintunde Oladipo; Kelechi Ogueji; Esther Adenuga; David Ifeoluwa Adelani; Jimmy Lin

arXiv:2505.19848·cs.CL·May 27, 2025

Improving Multilingual Math Reasoning for African Languages

Odunayo Ogundepo, Akintunde Oladipo, Kelechi Ogueji, Esther Adenuga, David Ifeoluwa Adelani, Jimmy Lin

PDF

Open Access

TL;DR

This paper systematically evaluates various adaptation strategies for extending large language models to African languages, focusing on mathematical reasoning tasks, to identify the most effective methods for low-resource language settings.

Contribution

It provides a comprehensive analysis of data types, training stages, and adaptation configurations for improving LLM performance on African languages.

Findings

01

Pre-training with translated data improves reasoning accuracy.

02

Post-training strategies enhance low-resource language adaptation.

03

Optimal combination of data and training stages yields best results.

Abstract

Researchers working on low-resource languages face persistent challenges due to limited data availability and restricted access to computational resources. Although most large language models (LLMs) are predominantly trained in high-resource languages, adapting them to low-resource contexts, particularly African languages, requires specialized techniques. Several strategies have emerged for adapting models to low-resource languages in todays LLM landscape, defined by multi-stage pre-training and post-training paradigms. However, the most effective approaches remain uncertain. This work systematically investigates which adaptation strategies yield the best performance when extending existing LLMs to African languages. We conduct extensive experiments and ablation studies to evaluate different combinations of data types (translated versus synthetically generated), training stages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics Education and Teaching Techniques

MethodsLLaMA · Balanced Selection