TL;DR
This paper introduces a novel adaptive image selection method for black-box few-shot knowledge distillation, significantly enhancing diversity and accuracy of the student model with limited data.
Contribution
It proposes an active, on-the-fly image selection strategy within GAN training to improve diversity in few-shot knowledge distillation.
Findings
Achieves state-of-the-art results on seven image datasets.
Significantly improves student accuracy with limited training data.
Enhances diversity of synthetic images for better knowledge transfer.
Abstract
Knowledge distillation (KD) is a well-known technique to effectively compress a large network (teacher) to a smaller network (student) with little sacrifice in performance. However, most KD methods require a large training set and internal access to the teacher, which are rarely available due to various restrictions. These challenges have originated a more practical setting known as black-box few-shot KD, where the student is trained with few images and a black-box teacher. Recent approaches typically generate additional synthetic images but lack an active strategy to promote their diversity, a crucial factor for student learning. To address these problems, we propose a novel training scheme for generative adversarial networks, where we adaptively select high-confidence images under the teacher's supervision and introduce them to the adversarial learning on-the-fly. Our approach helps…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
