A Protection against the Extraction of Neural Network Models

Herv\'e Chabanne; Vincent Despiegel; Linda Guiga

arXiv:2005.12782·cs.LG·August 3, 2020·1 cites

A Protection against the Extraction of Neural Network Models

Herv\'e Chabanne, Vincent Despiegel, Linda Guiga

PDF

Open Access

TL;DR

This paper proposes a novel protection method against neural network model extraction attacks by adding parasitic layers that preserve performance while complicating reverse-engineering efforts.

Contribution

It introduces a new defense mechanism using parasitic layers and explains why this approach increases the difficulty of model extraction attacks.

Findings

01

Parasitic layers maintain the original model's predictions.

02

The protection method effectively complicates model extraction.

03

Experimental results show minimal impact on accuracy.

Abstract

Given oracle access to a Neural Network (NN), it is possible to extract its underlying model. We here introduce a protection by adding parasitic layers which keep the underlying NN's predictions mostly unchanged while complexifying the task of reverse-engineering. Our countermeasure relies on approximating a noisy identity mapping with a Convolutional NN. We explain why the introduction of new parasitic layers complexifies the attacks. We report experiments regarding the performance and the accuracy of the protected NN.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning