LAB: Large-Scale Alignment for ChatBots
Shivchander Sudalairaj, Abhishek Bhandwaldar, Aldo Pareja, Kai Xu,, David D. Cox, Akash Srivastava

TL;DR
LAB introduces a scalable, cost-effective method for training large language models using synthetic data generation and multi-phase tuning, reducing dependence on costly human annotations and proprietary models.
Contribution
The paper presents LAB, a novel scalable framework for instruction tuning of LLMs that leverages taxonomy-guided synthetic data and multi-phase training to improve efficiency and performance.
Findings
LAB-trained models perform competitively on benchmarks.
Reduces reliance on expensive human annotations.
Achieves scalable instruction-following capabilities.
Abstract
This work introduces LAB (Large-scale Alignment for chatBots), a novel methodology designed to overcome the scalability challenges in the instruction-tuning phase of large language model (LLM) training. Leveraging a taxonomy-guided synthetic data generation process and a multi-phase tuning framework, LAB significantly reduces reliance on expensive human annotations and proprietary models like GPT-4. We demonstrate that LAB-trained models can achieve competitive performance across several benchmarks compared to models trained with traditional human-annotated or GPT-4 generated synthetic data. Thus offering a scalable, cost-effective solution for enhancing LLM capabilities and instruction-following behaviors without the drawbacks of catastrophic forgetting, marking a step forward in the efficient training of LLMs for a wide range of applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ibm-research/labradorite-13bmodel· 10 dl· ♡ 7510 dl♡ 75
- 🤗ibm-research/merlinite-7bmodel· 228 dl· ♡ 105228 dl♡ 105
- 🤗solidrust/merlinite-7b-AWQmodel· 4 dl4 dl
- 🤗MaziyarPanahi/merlinite-7b-GGUFmodel· 88 dl· ♡ 588 dl♡ 5
- 🤗instructlab/merlinite-7b-labmodel· 99 dl· ♡ 2299 dl♡ 22
- 🤗instructlab/granite-7b-labmodel· 473 dl· ♡ 43473 dl♡ 43
- 🤗RichardErkhov/ibm_-_merlinite-7b-8bitsmodel
- 🤗RichardErkhov/ibm_-_merlinite-7b-ggufmodel· 100 dl100 dl
- 🤗RichardErkhov/instructlab_-_merlinite-7b-lab-4bitsmodel· 1 dl1 dl
- 🤗RichardErkhov/instructlab_-_merlinite-7b-lab-8bitsmodel· 3 dl3 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · AI in Service Interactions
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Multi-Head Attention · Layer Normalization · Dropout · Softmax · Dense Connections · Label Smoothing · Adam
