Data-Free Model Extraction

Jean-Baptiste Truong; Pratyush Maini; Robert J. Walls; Nicolas; Papernot

arXiv:2011.14779·cs.LG·April 1, 2021

Data-Free Model Extraction

Jean-Baptiste Truong, Pratyush Maini, Robert J. Walls, Nicolas, Papernot

PDF

2 Repos

TL;DR

This paper introduces data-free model extraction techniques that do not require surrogate datasets, enabling high-accuracy replication of victim models in black-box settings with limited data access.

Contribution

It presents novel data-free extraction methods adapting knowledge transfer techniques, addressing challenges of limited access and no surrogate data.

Findings

01

Achieves 99% and 92% accuracy on SVHN and CIFAR-10

02

Uses 2 million and 20 million queries respectively

03

High accuracy with reasonable query complexity

Abstract

Current model extraction attacks assume that the adversary has access to a surrogate dataset with characteristics similar to the proprietary data used to train the victim model. This requirement precludes the use of existing model extraction techniques on valuable models, such as those trained on rare or hard to acquire datasets. In contrast, we propose data-free model extraction methods that do not require a surrogate dataset. Our approach adapts techniques from the area of data-free knowledge transfer for model extraction. As part of our study, we identify that the choice of loss is critical to ensuring that the extracted model is an accurate replica of the victim model. Furthermore, we address difficulties arising from the adversary's limited access to the victim model in a black-box setting. For example, we recover the model's logits from its probability predictions to approximate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.