Birbal: An efficient 7B instruct-model fine-tuned with curated datasets

Ashvini Kumar Jindal; Pawan Kumar Rajpoot; Ankur Parikh

arXiv:2403.02247·cs.CL·March 5, 2024·2 cites

Birbal: An efficient 7B instruct-model fine-tuned with curated datasets

Ashvini Kumar Jindal, Pawan Kumar Rajpoot, Ankur Parikh

PDF

Open Access 1 Repo

TL;DR

Birbal is a fine-tuned 7B instruction model that achieves significant performance gains using a cost-effective, single-GPU setup and curated datasets, demonstrating efficient and transparent large language model adaptation.

Contribution

We introduce Birbal, a 7B instruction-tuned model trained on curated datasets with minimal resources, showcasing improved performance and transparency in LLM fine-tuning.

Findings

01

35% performance improvement over Qwen-14B

02

Fine-tuned on a single GPU within 16 hours

03

Curated diverse high-quality instruction datasets

Abstract

LLMOps incur significant costs due to hardware requirements, hindering their widespread accessibility. Additionally, a lack of transparency in model training methods and data contributes to the majority of models being non-reproducible. To tackle these challenges, the LLM Efficiency Challenge was introduced at NeurIPS Workshop, aiming to adapt foundation models on a diverse set of tasks via fine-tuning on a single GPU (RTX 4090 or A100 with 40GB) within a 24-hour timeframe. In this system description paper, we introduce Birbal, our Mistral-7B based winning model, fine-tuned on a single RTX 4090 for 16 hours. Birbal's success lies in curating high-quality instructions covering diverse tasks, resulting in a 35% performance improvement over second-best Qwen-14B based submission.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

upaya07/neurips-llm-efficiency-challenge
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis

MethodsSparse Evolutionary Training