Improving Diversity in Black-box Few-shot Knowledge Distillation

Tri-Nhan Vo; Dang Nguyen; Kien Do; Sunil Gupta

arXiv:2604.25795·cs.CV·April 29, 2026

Improving Diversity in Black-box Few-shot Knowledge Distillation

Tri-Nhan Vo, Dang Nguyen, Kien Do, Sunil Gupta

PDF

1 Repo

TL;DR

This paper introduces a novel adaptive image selection method for black-box few-shot knowledge distillation, significantly enhancing diversity and accuracy of the student model with limited data.

Contribution

It proposes an active, on-the-fly image selection strategy within GAN training to improve diversity in few-shot knowledge distillation.

Findings

01

Achieves state-of-the-art results on seven image datasets.

02

Significantly improves student accuracy with limited training data.

03

Enhances diversity of synthetic images for better knowledge transfer.

Abstract

Knowledge distillation (KD) is a well-known technique to effectively compress a large network (teacher) to a smaller network (student) with little sacrifice in performance. However, most KD methods require a large training set and internal access to the teacher, which are rarely available due to various restrictions. These challenges have originated a more practical setting known as black-box few-shot KD, where the student is trained with few images and a black-box teacher. Recent approaches typically generate additional synthetic images but lack an active strategy to promote their diversity, a crucial factor for student learning. To address these problems, we propose a novel training scheme for generative adversarial networks, where we adaptively select high-confidence images under the teacher's supervision and introduce them to the adversarial learning on-the-fly. Our approach helps…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

votrinhan88/divbfkd
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.