Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations

Faisal Hamman; Pasan Dissanayake; Yanjun Fu; Sanghamitra Dutta

arXiv:2510.21631·cs.LG·October 27, 2025

Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations

Faisal Hamman, Pasan Dissanayake, Yanjun Fu, Sanghamitra Dutta

PDF

1 Video

TL;DR

This paper introduces CoD, a novel few-shot knowledge distillation method that uses counterfactual explanations to efficiently transfer knowledge from large models to smaller ones with minimal data.

Contribution

The paper proposes a new distillation strategy leveraging counterfactual explanations to improve knowledge transfer in few-shot settings, with theoretical and empirical validation.

Findings

01

CoD outperforms standard methods with fewer samples.

02

Using CFEs enhances the decision boundary approximation.

03

Method is effective across various datasets and LLMs.

Abstract

Knowledge distillation is a promising approach to transfer capabilities from complex teacher models to smaller, resource-efficient student models that can be deployed easily, particularly in task-aware scenarios. However, existing methods of task-aware distillation typically require substantial quantities of data which may be unavailable or expensive to obtain in many practical scenarios. In this paper, we address this challenge by introducing a novel strategy called Counterfactual-explanation-infused Distillation CoD for few-shot task-aware knowledge distillation by systematically infusing counterfactual explanations. Counterfactual explanations (CFEs) refer to inputs that can flip the output prediction of the teacher model with minimum perturbation. Our strategy CoD leverages these CFEs to precisely map the teacher's decision boundary with significantly fewer samples. We provide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations· slideslive