Isolation and Induction: Training Robust Deep Neural Networks against Model Stealing Attacks
Jun Guo, Aishan Liu, Xingyu Zheng, Siyuan Liang, Yisong Xiao, Yichao, Wu, Xianglong Liu

TL;DR
This paper introduces Isolation and Induction (InI), a training framework that enhances the robustness of deep neural networks against model stealing attacks by reducing inference costs and producing uninformative outputs for stealing queries.
Contribution
InI is a novel training method that isolates adversarial gradients and induces uninformative outputs, improving robustness without auxiliary modules or high computational overhead.
Findings
Reduces model stealing accuracy by up to 48%.
Speeds up inference by up to 25.4 times.
Maintains benign model accuracy while defending against attacks.
Abstract
Despite the broad application of Machine Learning models as a Service (MLaaS), they are vulnerable to model stealing attacks. These attacks can replicate the model functionality by using the black-box query process without any prior knowledge of the target victim model. Existing stealing defenses add deceptive perturbations to the victim's posterior probabilities to mislead the attackers. However, these defenses are now suffering problems of high inference computational overheads and unfavorable trade-offs between benign accuracy and stealing robustness, which challenges the feasibility of deployed models in practice. To address the problems, this paper proposes Isolation and Induction (InI), a novel and effective training framework for model stealing defenses. Instead of deploying auxiliary defense modules that introduce redundant inference time, InI directly trains a defensive model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
Methodstravel james · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
