IndicXNLI: Evaluating Multilingual Inference for Indian Languages

Divyanshu Aggarwal; Vivek Gupta; Anoop Kunchukuttan

arXiv:2204.08776·cs.CL·April 20, 2022·5 cites

IndicXNLI: Evaluating Multilingual Inference for Indian Languages

Divyanshu Aggarwal, Vivek Gupta, Anoop Kunchukuttan

PDF

Open Access 1 Repo 4 Datasets

TL;DR

This paper introduces IndicXNLI, a new benchmark dataset for natural language inference in 11 Indian languages, created through high-quality machine translation of the English XNLI dataset, to evaluate multilingual models.

Contribution

The paper presents IndicXNLI, a novel NLI dataset for Indian languages, and analyzes cross-lingual transfer techniques using various pre-trained language models.

Findings

01

Insights into model performance across languages

02

Impact of multi-linguality and input mixing

03

Evaluation of transfer techniques

Abstract

While Indic NLP has made rapid advances recently in terms of the availability of corpora and pre-trained models, benchmark datasets on standard NLU tasks are limited. To this end, we introduce IndicXNLI, an NLI dataset for 11 Indic languages. It has been created by high-quality machine translation of the original English XNLI dataset and our analysis attests to the quality of IndicXNLI. By finetuning different pre-trained LMs on this IndicXNLI, we analyze various cross-lingual transfer techniques with respect to the impact of the choice of language models, languages, multi-linguality, mix-language input, etc. These experiments provide us with useful insights into the behaviour of pre-trained models for a diverse set of languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

divyanshuaggarwal/indicxnli
pytorchOfficial

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications