Black-Box Ripper: Copying black-box models using generative evolutionary algorithms
Antonio Barbalau, Adrian Cosma, Radu Tudor Ionescu, Marius Popescu

TL;DR
This paper introduces a novel method to replicate black-box neural models by generating data samples using evolutionary strategies and training a student model, achieving superior performance without access to training data or back-propagation.
Contribution
It presents a teacher-student framework that distills black-box models by generating high-response data samples through evolutionary algorithms, outperforming existing methods.
Findings
Our method surpasses state-of-the-art black-box model replication techniques.
It effectively generates data samples that elicit high responses from the black-box model.
The approach maintains high accuracy in the student model despite no access to training data or back-propagation.
Abstract
We study the task of replicating the functionality of black-box neural models, for which we only know the output class probabilities provided for a set of input images. We assume back-propagation through the black-box model is not possible and its training images are not available, e.g. the model could be exposed only through an API. In this context, we present a teacher-student framework that can distill the black-box (teacher) model into a student model with minimal accuracy loss. To generate useful data samples for training the student, our framework (i) learns to generate images on a proxy data set (with images and classes different from those used to train the black-box) and (ii) applies an evolutionary strategy to make sure that each generated data sample exhibits a high response for a specific class when given as input to the black box. Our framework is compared with several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
