Large Language Models for Math Education in Low-Resource Languages: A Study in Sinhala and Tamil
Sukumar Kishanthan, Kumar Thushalika, Buddhi Jayasekara, Asela Hevapathige

TL;DR
This study evaluates the mathematical reasoning capabilities of large language models in Sinhala and Tamil, revealing language-dependent performance gaps and emphasizing the importance of language-specific assessments for educational AI deployment.
Contribution
It introduces a bilingual dataset authored by native speakers and systematically assesses LLMs' math reasoning across languages, highlighting the variability in model performance.
Findings
Basic arithmetic reasoning transfers well across Sinhala, Tamil, and English.
Complex reasoning tasks show significant performance degradation in Sinhala and Tamil.
Performance varies by model and problem type, indicating language-specific challenges.
Abstract
Large language models (LLMs) have achieved strong results in mathematical reasoning, and are increasingly deployed as tutoring and learning support tools in educational settings. However, their reliability for students working in non-English languages, especially low-resource languages, remains poorly understood. We examine this gap by evaluating mathematical reasoning in Sinhala and Tamil -- two languages widely used in South Asian schools but underrepresented in artificial intelligence (AI) research. Using a taxonomy of six math problem types, from basic arithmetic to complex unit conflict and optimization problems, we evaluate four prominent large language models. To avoid translation artifacts that confound language ability with translation quality, we construct a parallel dataset in which each problem is independently authored in Sinhala and Tamil by native speakers, and in English…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
