CaBaGe: Data-Free Model Extraction using ClAss BAlanced Generator Ensemble
Jonathan Rosenthal, Shanchao Liang, Kevin Zhang, Lin Tan

TL;DR
CaBaGe is a novel data-free model extraction method that uses ensemble generators and selective querying to accurately replicate black-box models with fewer queries, even without prior knowledge of class numbers.
Contribution
It introduces a new data-free extraction approach with ensemble generators and adaptive filtering, addressing realistic scenarios where class information is unknown.
Findings
Outperforms existing methods on seven datasets.
Achieves up to 43.13% accuracy improvement in extracted models.
Reduces query count by up to 75.7%.
Abstract
Machine Learning as a Service (MLaaS) is often provided as a pay-per-query, black-box system to clients. Such a black-box approach not only hinders open replication, validation, and interpretation of model results, but also makes it harder for white-hat researchers to identify vulnerabilities in the MLaaS systems. Model extraction is a promising technique to address these challenges by reverse-engineering black-box models. Since training data is typically unavailable for MLaaS models, this paper focuses on the realistic version of it: data-free model extraction. We propose a data-free model extraction approach, CaBaGe, to achieve higher model extraction accuracy with a small number of queries. Our innovations include (1) a novel experience replay for focusing on difficult training samples; (2) an ensemble of generators for steadily producing diverse synthetic data; and (3) a selective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSAS software applications and methods · Software Engineering Research · Engineering and Information Technology
Methodstravel james · Experience Replay
