Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets
Dongyue Li, Ziniu Zhang, Lu Wang, and Hongyang R. Zhang

TL;DR
This paper introduces an efficient ensemble method using multiple small adapters for fine-tuning large language models on multiple datasets, significantly reducing computation while improving accuracy.
Contribution
It proposes a novel ensemble approach with dataset grouping and a first-order approximation for performance estimation, enabling faster and more accurate multi-dataset fine-tuning.
Findings
Achieves up to 10% higher accuracy than QLoRA on text classification tasks.
Reduces fine-tuning computation by over 100 times using approximation methods.
Improves test accuracy by 3% on a 34-billion-parameter Llama model.
Abstract
This paper develops an ensemble method for fine-tuning a language model to multiple datasets. Existing methods, such as quantized LoRA (QLoRA), are efficient when adapting to a single dataset. When training on multiple datasets of different tasks, a common setup in practice, it remains unclear how to design an efficient adaptation for fine-tuning language models. We propose to use an ensemble of multiple smaller adapters instead of a single adapter per task. We design an efficient algorithm that partitions datasets into groups, where is typically much smaller than in practice, and train one adapter for each group before taking a weighted combination to form the ensemble. The algorithm leverages a first-order approximation property of low-rank adaptation to quickly obtain the fine-tuning performances of dataset combinations since methods like LoRA stay close to the base…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning and Data Classification · Natural Language Processing Techniques
