Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation
Vlad Hondru, Radu Tudor Ionescu

TL;DR
This paper introduces a novel framework for few-call model stealing that leverages diffusion-generated synthetic data, active self-paced learning, and knowledge distillation to effectively copy black-box models without access to training data or architecture.
Contribution
The paper proposes a new approach combining diffusion models, active self-paced learning, and knowledge distillation for efficient model stealing under limited API calls.
Findings
Outperforms four state-of-the-art methods in few-call scenarios
Effective use of diffusion-generated data enhances model extraction
Framework works across multiple datasets
Abstract
Diffusion models showcase strong capabilities in image synthesis, being used in many computer vision tasks with great success. To this end, we propose to explore a new use case, namely to copy black-box classification models without having access to the original training data, the architecture, and the weights of the model, i.e. the model is only exposed through an inference API. More specifically, we can only observe the (soft or hard) labels for some image samples passed as input to the model. Furthermore, we consider an additional constraint limiting the number of model calls, mostly focusing our research on few-call model stealing. In order to solve the model extraction task given the applied restrictions, we propose the following framework. As training data, we create a synthetic data set (called proxy data set) by leveraging the ability of diffusion models to generate realistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Image Processing Techniques and Applications · Domain Adaptation and Few-Shot Learning
MethodsDiffusion
