Isolation and Induction: Training Robust Deep Neural Networks against   Model Stealing Attacks

Jun Guo; Aishan Liu; Xingyu Zheng; Siyuan Liang; Yisong Xiao; Yichao; Wu; Xianglong Liu

arXiv:2308.00958·cs.CR·August 4, 2023

Isolation and Induction: Training Robust Deep Neural Networks against Model Stealing Attacks

Jun Guo, Aishan Liu, Xingyu Zheng, Siyuan Liang, Yisong Xiao, Yichao, Wu, Xianglong Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces Isolation and Induction (InI), a training framework that enhances the robustness of deep neural networks against model stealing attacks by reducing inference costs and producing uninformative outputs for stealing queries.

Contribution

InI is a novel training method that isolates adversarial gradients and induces uninformative outputs, improving robustness without auxiliary modules or high computational overhead.

Findings

01

Reduces model stealing accuracy by up to 48%.

02

Speeds up inference by up to 25.4 times.

03

Maintains benign model accuracy while defending against attacks.

Abstract

Despite the broad application of Machine Learning models as a Service (MLaaS), they are vulnerable to model stealing attacks. These attacks can replicate the model functionality by using the black-box query process without any prior knowledge of the target victim model. Existing stealing defenses add deceptive perturbations to the victim's posterior probabilities to mislead the attackers. However, these defenses are now suffering problems of high inference computational overheads and unfavorable trade-offs between benign accuracy and stealing robustness, which challenges the feasibility of deployed models in practice. To address the problems, this paper proposes Isolation and Induction (InI), a novel and effective training framework for model stealing defenses. Instead of deploying auxiliary defense modules that introduce redundant inference time, InI directly trains a defensive model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dig-beihang/ini-model-stealing-defense
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

Methodstravel james · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings