One Instruction Does Not Fit All: How Well Do Embeddings Align Personas and Instructions in Low-Resource Indian Languages?

Arya Shah; Himanshu beniwal; Mayank Singh

arXiv:2601.10205·cs.CL·January 16, 2026

One Instruction Does Not Fit All: How Well Do Embeddings Align Personas and Instructions in Low-Resource Indian Languages?

Arya Shah, Himanshu beniwal, Mayank Singh

PDF

Open Access 1 Datasets

TL;DR

This paper introduces a comprehensive benchmark for evaluating multilingual embedding models in Indian languages, focusing on persona-instruction alignment across retrieval and classification tasks, to guide model selection and future research.

Contribution

It presents a unified benchmark covering 12 Indian languages with multiple evaluation tasks, and provides baseline results for various multilingual embedding models in this context.

Findings

01

E5-Large-Instruct achieves highest monolingual retrieval recall@1 (27.4%)

02

BGE-M3 leads in cross-lingual transfer with 20.7% recall@1

03

LaBSE attains 75.3% AUROC in classification tasks

Abstract

Aligning multilingual assistants with culturally grounded user preferences is essential for serving India's linguistically diverse population of over one billion speakers across multiple scripts. However, existing benchmarks either focus on a single language or conflate retrieval with generation, leaving open the question of whether current embedding models can encode persona-instruction compatibility without relying on response synthesis. We present a unified benchmark spanning 12 Indian languages and four evaluation tasks: monolingual and cross-lingual persona-to-instruction retrieval, reverse retrieval from instruction to persona, and binary compatibility classification. Eight multilingual embedding models are evaluated in a frozen-encoder setting with a thin logistic regression head for classification. E5-Large-Instruct achieves the highest Recall@1 of 27.4\% on monolingual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

LingoIITGN/PI-Indic-Align
dataset· 70 dl
70 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPersona Design and Applications · Topic Modeling · Text Readability and Simplification