Diverse Image Priors for Black-box Data-free Knowledge Distillation

Tri-Nhan Vo; Dang Nguyen; Trung Le; Kien Do; Sunil Gupta

arXiv:2604.25794·cs.LG·April 29, 2026

Diverse Image Priors for Black-box Data-free Knowledge Distillation

Tri-Nhan Vo, Dang Nguyen, Trung Le, Kien Do, Sunil Gupta

PDF

TL;DR

DIP-KD is a novel framework for black-box, data-free knowledge distillation that leverages diverse image priors, contrastive learning, and a primer student to improve model transfer without access to original data.

Contribution

It introduces a three-phase collaborative pipeline that synthesizes diverse image priors, enhances synthetic data distinction, and employs a primer student for effective knowledge transfer.

Findings

01

Achieves state-of-the-art results across 12 benchmarks.

02

Data diversity is critical for effective knowledge distillation in restricted environments.

03

Contrastive learning enhances the distinction of synthetic samples.

Abstract

Knowledge distillation (KD) represents a vital mechanism to transfer expertise from complex teacher networks to efficient student models. However, in decentralized or secure AI ecosystems, privacy regulations and proprietary interests often restrict access to the teacher's interface and original datasets. These constraints define a challenging black-box data-free KD scenario where only top-1 predictions and no training data are available. While recent approaches utilize synthetic data, they still face limitations in data diversity and distillation signals. We propose Diverse Image Priors Knowledge Distillation (DIP-KD), a framework that addresses these challenges through a three-phase collaborative pipeline: (1) Synthesis of image priors to capture diverse visual patterns and semantics; (2) Contrast to enhance the collective distinction between synthetic samples via contrastive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.