Towards Few-Call Model Stealing via Active Self-Paced Knowledge   Distillation and Diffusion-Based Image Generation

Vlad Hondru; Radu Tudor Ionescu

arXiv:2310.00096·cs.CV·March 17, 2025·1 cites

Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation

Vlad Hondru, Radu Tudor Ionescu

PDF

Open Access

TL;DR

This paper introduces a novel framework for few-call model stealing that leverages diffusion-generated synthetic data, active self-paced learning, and knowledge distillation to effectively copy black-box models without access to training data or architecture.

Contribution

The paper proposes a new approach combining diffusion models, active self-paced learning, and knowledge distillation for efficient model stealing under limited API calls.

Findings

01

Outperforms four state-of-the-art methods in few-call scenarios

02

Effective use of diffusion-generated data enhances model extraction

03

Framework works across multiple datasets

Abstract

Diffusion models showcase strong capabilities in image synthesis, being used in many computer vision tasks with great success. To this end, we propose to explore a new use case, namely to copy black-box classification models without having access to the original training data, the architecture, and the weights of the model, i.e. the model is only exposed through an inference API. More specifically, we can only observe the (soft or hard) labels for some image samples passed as input to the model. Furthermore, we consider an additional constraint limiting the number of model calls, mostly focusing our research on few-call model stealing. In order to solve the model extraction task given the applied restrictions, we propose the following framework. As training data, we create a synthetic data set (called proxy data set) by leveraging the ability of diffusion models to generate realistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Image Processing Techniques and Applications · Domain Adaptation and Few-Shot Learning

MethodsDiffusion