"What's in the box?!": Deflecting Adversarial Attacks by Randomly   Deploying Adversarially-Disjoint Models

Sahar Abdelnabi; Mario Fritz

arXiv:2102.05104·cs.LG·March 10, 2021·1 cites

"What's in the box?!": Deflecting Adversarial Attacks by Randomly Deploying Adversarially-Disjoint Models

Sahar Abdelnabi, Mario Fritz

PDF

Open Access

TL;DR

This paper introduces a novel defense strategy against adversarial attacks by deploying multiple adversarially-disjoint models randomly, significantly reducing attack transferability and improving robustness without sacrificing clean data accuracy.

Contribution

Proposes a deployment-based defense using adversarially-disjoint models that minimizes attack transferability and enhances robustness over traditional ensemble methods.

Findings

01

Lower attack transferability across models compared to ensemble diversity.

02

Higher average robust accuracy than adversarially trained sets.

03

Maintains accuracy on clean examples.

Abstract

Machine learning models are now widely deployed in real-world applications. However, the existence of adversarial examples has been long considered a real threat to such models. While numerous defenses aiming to improve the robustness have been proposed, many have been shown ineffective. As these vulnerabilities are still nowhere near being eliminated, we propose an alternative deployment-based defense paradigm that goes beyond the traditional white-box and black-box threat models. Instead of training a single partially-robust model, one could train a set of same-functionality, yet, adversarially-disjoint models with minimal in-between attack transferability. These models could then be randomly and individually deployed, such that accessing one of them minimally affects the others. Our experiments on CIFAR-10 and a wide range of attacks show that we achieve a significantly lower attack…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Cardiac Arrest and Resuscitation · Advanced Malware Detection Techniques