LLM on a Budget: Active Knowledge Distillation for Efficient Classification of Large Text Corpora
Viviana Luccioli, Rithika Iyengar, Ryan Panley, Flora Haberkorn, Xiaoyu Ge, Leland Crane, Nitish Sinha, Seung Jung Lee

TL;DR
This paper introduces M-RARU, an active learning algorithm that reduces the cost of knowledge distillation for large language models by selecting only the most informative data points, achieving significant savings and maintaining high accuracy.
Contribution
The paper proposes M-RARU, a novel active learning method combining uncertainty and randomized accept-reject strategies to efficiently distill large models with fewer samples.
Findings
Achieves up to 80% reduction in sample requirements compared to random sampling.
Maintains high classification accuracy with significantly lower costs.
Reduces training time and API calls in knowledge distillation process.
Abstract
Large Language Models (LLMs) are highly accurate in classification tasks, however, substantial computational and financial costs hinder their large-scale deployment in dynamic environments. Knowledge Distillation (KD) where a LLM "teacher" trains a smaller and more efficient "student" model, offers a promising solution to this problem. However, the distillation process itself often remains costly for large datasets, since it requires the teacher to label a vast number of samples while incurring significant token consumption. To alleviate this challenge, in this work we explore the active learning (AL) as a way to create efficient student models at a fraction of the cost while preserving the LLM's performance. In particular, we introduce M-RARU (Multi-class Randomized Accept/Reject Uncertainty Sampling), a novel AL algorithm that significantly reduces training costs. M-RARU employs an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Topic Modeling · Machine Learning in Materials Science
