MATA: Mindful Assessment of the Telugu Abilities of Large Language Models

Chalamalasetti Kranti; Sowmya Vajjala

arXiv:2508.13526·cs.CL·March 19, 2026

MATA: Mindful Assessment of the Telugu Abilities of Large Language Models

Chalamalasetti Kranti, Sowmya Vajjala

PDF

Open Access 1 Datasets

TL;DR

This paper introduces MATA, a comprehensive Telugu language evaluation dataset for LLMs, revealing their reliance on superficial heuristics and comparing model judgments with human assessments to improve linguistic capabilities.

Contribution

The paper presents MATA, a new Telugu language dataset for LLM evaluation, and analyzes model performance, heuristics reliance, and evaluation reliability in low-resource settings.

Findings

01

LLMs rely on superficial heuristics like answer position.

02

Performance varies significantly across models.

03

Model evaluation correlates with human judgment in open-ended tasks.

Abstract

In this paper, we introduce MATA, a novel evaluation dataset to assess the ability of Large Language Models (LLMs) in Telugu language, comprising 729 carefully curated multiple-choice and open-ended questions that span diverse linguistic dimensions. We evaluate 11 open-weight and closed-source LLMs on our dataset and present a fine-grained analysis of their performance. Further, we empirically show how LLMs rely on superficial heuristics such as answer position and distractor patterns for multiple-choice questions. Finally, we also compare LLM-as-a-judge evaluation with human evaluation for open-ended questions assess its reliability in a low-resource language. We argue that such fine-grained evaluation is essential for understanding model limitations and can inform the development of more linguistically capable LLMs, while also serving as a foundation for future research in Telugu NLP.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

TeluguLLMResearch/MATA
dataset· 9 dl
9 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational and Psychological Assessments