Multilingual Question Answering in Low-Resource Settings: A Dzongkha-English Benchmark for Foundation Models

Md. Tanzib Hosain; Rajan Das Gupta; Md. Kishor Morol

arXiv:2505.18638·cs.CL·May 30, 2025

Multilingual Question Answering in Low-Resource Settings: A Dzongkha-English Benchmark for Foundation Models

Md. Tanzib Hosain, Rajan Das Gupta, Md. Kishor Morol

PDF

TL;DR

This paper introduces DZEN, a Dzongkha-English dataset for scientific questions, and evaluates large language models' performance in low-resource language question answering, highlighting challenges and strategies for improvement.

Contribution

The creation of a parallel Dzongkha-English question dataset and an analysis of LLM performance and prompting strategies in a low-resource language context.

Findings

01

LLMs perform significantly worse in Dzongkha than in English.

02

Chain-of-Thought prompting improves reasoning question accuracy.

03

Adding English translations enhances Dzongkha question response precision.

Abstract

In this work, we provide DZEN, a dataset of parallel Dzongkha and English test questions for Bhutanese middle and high school students. The over 5K questions in our collection span a variety of scientific topics and include factual, application, and reasoning-based questions. We use our parallel dataset to test a number of Large Language Models (LLMs) and find a significant performance difference between the models in English and Dzongkha. We also look at different prompting strategies and discover that Chain-of-Thought (CoT) prompting works well for reasoning questions but less well for factual ones. We also find that adding English translations enhances the precision of Dzongkha question responses. Our results point to exciting avenues for further study to improve LLM performance in Dzongkha and, more generally, in low-resource languages. We release the dataset at:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.