IndicParam: Benchmark to evaluate LLMs on low-resource Indic Languages

Ayush Maheshwari; Kaushal Sharma; Vivek Patel; Aditya Maheshwari

arXiv:2512.00333·cs.CL·January 13, 2026

IndicParam: Benchmark to evaluate LLMs on low-resource Indic Languages

Ayush Maheshwari, Kaushal Sharma, Vivek Patel, Aditya Maheshwari

PDF

Open Access 1 Datasets

TL;DR

IndicParam is a comprehensive benchmark evaluating 20 large language models on over 13,000 questions across 11 low-resource Indic languages, revealing significant performance gaps and diverse question handling capabilities.

Contribution

The paper introduces IndicParam, a new benchmark dataset for low-resource Indic languages, including diverse question formats and linguistic annotations, to evaluate LLMs' multilingual and cross-lingual abilities.

Findings

01

Top model achieves 58% accuracy on average

02

LLMs struggle with low-resource Indic languages

03

Benchmark reveals limitations in cross-lingual transfer

Abstract

While large language models excel on high-resource multilingual tasks, low- and extremely low-resource Indic languages remain severely under-evaluated. We present IndicParam, a human-curated benchmark of over 13,000 multiple-choice questions covering 11 such languages (Nepali, Gujarati, Marathi, Odia as low-resource; Dogri, Maithili, Rajasthani, Sanskrit, Bodo, Santali, Konkani as extremely low-resource) plus Sanskrit-English code-mixed set. We evaluated 20 LLMs, both proprietary and open-weights, which reveals that even the top-performing \texttt{Gemini-2.5} reaches 58\% average accuracy, followed by \texttt{GPT-5} (45) and \texttt{DeepSeek-3.2} (43.1). We additionally label each question as knowledge-oriented or purely linguistic to discriminate factual recall from grammatical proficiency. Further, we assess the ability of LLMs to handle diverse question formats-such as list-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

bharatgenai/IndicParam
dataset· 82 dl
82 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification