Airavata: Introducing Hindi Instruction-tuned LLM

Jay Gala; Thanmay Jayakumar; Jaavid Aktar Husain; Aswanth Kumar M,; Mohammed Safi Ur Rahman Khan; Diptesh Kanojia; Ratish Puduppully; Mitesh M.; Khapra; Raj Dabre; Rudra Murthy; Anoop Kunchukuttan

arXiv:2401.15006·cs.CL·February 27, 2024·2 cites

Airavata: Introducing Hindi Instruction-tuned LLM

Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, Aswanth Kumar M,, Mohammed Safi Ur Rahman Khan, Diptesh Kanojia, Ratish Puduppully, Mitesh M., Khapra, Raj Dabre, Rudra Murthy, Anoop Kunchukuttan

PDF

Open Access 1 Repo 4 Models 2 Datasets

TL;DR

Airavata is an instruction-tuned large language model for Hindi, created through fine-tuning on diverse datasets, with accompanying benchmarks and datasets to advance research in Indic languages.

Contribution

The paper introduces Airavata, a novel Hindi instruction-tuned LLM, along with the IndicInstruct dataset and evaluation framework, facilitating further research in Indic language models.

Findings

01

Airavata performs well on Hindi assistive tasks.

02

IndicInstruct dataset enables diverse instruction tuning.

03

Framework allows comprehensive evaluation of Hindi LLMs.

Abstract

We announce the initial release of "Airavata," an instruction-tuned LLM for Hindi. Airavata was created by fine-tuning OpenHathi with diverse, instruction-tuning Hindi datasets to make it better suited for assistive tasks. Along with the model, we also share the IndicInstruct dataset, which is a collection of diverse instruction-tuning datasets to enable further research for Indic LLMs. Additionally, we present evaluation benchmarks and a framework for assessing LLM performance across tasks in Hindi. Currently, Airavata supports Hindi, but we plan to expand this to all 22 scheduled Indic languages. You can access all artifacts at https://ai4bharat.github.io/airavata.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ai4bharat/indicinstruct
pytorchOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Multimodal Machine Learning Applications · Topic Modeling