HinTel-AlignBench: A Framework and Benchmark for Hindi-Telugu with English-Aligned Samples

Rishikant Chigrupaatii; Ponnada Sai Tulasi Kanishka; Lalit Chandra Routhu; Martin Patel Sama Supratheek Reddy; Divyam Gupta; Dasari Srikar; Krishna Teja Kuchimanchi; Rajiv Misra; Rohun Tripathi

arXiv:2511.15183·cs.CL·November 20, 2025

HinTel-AlignBench: A Framework and Benchmark for Hindi-Telugu with English-Aligned Samples

Rishikant Chigrupaatii, Ponnada Sai Tulasi Kanishka, Lalit Chandra Routhu, Martin Patel Sama Supratheek Reddy, Divyam Gupta, Dasari Srikar, Krishna Teja Kuchimanchi, Rajiv Misra, Rohun Tripathi

PDF

Open Access 1 Datasets

TL;DR

This paper introduces HinTel-AlignBench, a comprehensive benchmark and framework for evaluating vision-language models on Hindi and Telugu, addressing evaluation gaps and analyzing model performance across diverse Indian language datasets.

Contribution

It presents a semi-automated dataset creation framework, the HinTel-AlignBench benchmark with diverse datasets, and a detailed analysis of SOTA model performance on Indian languages.

Findings

01

Models perform worse on Indian languages compared to English.

02

Average performance regression is 8.3 points in Hindi and 5.5 in Telugu.

03

Identifies common failure modes to guide future improvements.

Abstract

With nearly 1.5 billion people and more than 120 major languages, India represents one of the most diverse regions in the world. As multilingual Vision-Language Models (VLMs) gain prominence, robust evaluation methodologies are essential to drive progress toward equitable AI for low-resource languages. Current multilingual VLM evaluations suffer from four major limitations: reliance on unverified auto-translations, narrow task/domain coverage, limited sample sizes, and lack of cultural and natively sourced Question-Answering (QA). To address these gaps, we present a scalable framework to evaluate VLMs in Indian languages and compare it with performance in English. Using the framework, we generate HinTel-AlignBench, a benchmark that draws from diverse sources in Hindi and Telugu with English-aligned samples. Our contributions are threefold: (1) a semi-automated dataset creation framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

lalit-03/IndicVisionBench
dataset· 73 dl
73 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling