Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages
Tejas Indulal Dhamecha, Rudra Murthy V, Samarth Bharadwaj, Karthik, Sankaranarayanan, Pushpak Bhattacharyya

TL;DR
This study demonstrates that multilingual fine-tuning of language models, especially with carefully selected related languages, significantly improves NLP task performance in Indo-Aryan languages, with benefits for low-resource languages.
Contribution
It provides the first detailed analysis of how language relatedness affects multilingual fine-tuning performance, highlighting the importance of selective language inclusion.
Findings
Multilingual fine-tuning improves downstream NLP tasks by up to 150%.
Careful selection of related languages outperforms using all related languages.
Low-resource languages benefit most from multilingual fine-tuning.
Abstract
We explore the impact of leveraging the relatedness of languages that belong to the same family in NLP models using multilingual fine-tuning. We hypothesize and validate that multilingual fine-tuning of pre-trained language models can yield better performance on downstream NLP applications, compared to models fine-tuned on individual languages. A first of its kind detailed study is presented to track performance change as languages are added to a base language in a graded and greedy (in the sense of best boost of performance) manner; which reveals that careful selection of subset of related languages can significantly improve performance than utilizing all related languages. The Indo-Aryan (IA) language family is chosen for the study, the exact languages being Bengali, Gujarati, Hindi, Marathi, Oriya, Punjabi and Urdu. The script barrier is crossed by simple rule-based transliteration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsTest · mBERT
