The Efficacy of SHIELD under Different Threat Models

Cory Cornelius; Nilaksh Das; Shang-Tse Chen; Li Chen; Michael E.; Kounavis; Duen Horng Chau

arXiv:1902.00541·cs.LG·August 6, 2019·5 cites

The Efficacy of SHIELD under Different Threat Models

Cory Cornelius, Nilaksh Das, Shang-Tse Chen, Li Chen, Michael E., Kounavis, Duen Horng Chau

PDF

Open Access

TL;DR

This paper evaluates the robustness of the SHIELD defense against adaptive adversaries across various threat models, revealing that training models from scratch enhances defense effectiveness and reduces attack success rates.

Contribution

It introduces an analysis of SHIELD under different threat models, including adaptive attackers, and compares the impact of training ensemble models from scratch versus re-training.

Findings

01

Targeted PGD attack success rate drops from 64.3% to 48.9% when models are trained from scratch.

02

Ensembles with models trained from scratch are less vulnerable to white-box and gray-box attacks.

03

Models trained from scratch show lower correlation in cosine similarity space, enhancing robustness.

Abstract

In this appraisal paper, we evaluate the efficacy of SHIELD, a compression-based defense framework for countering adversarial attacks on image classification models, which was published at KDD 2018. Here, we consider alternative threat models not studied in the original work, where we assume that an adaptive adversary is aware of the ensemble defense approach, the defensive pre-processing, and the architecture and weights of the models used in the ensemble. We define scenarios with varying levels of threat and empirically analyze the proposed defense by varying the degree of information available to the attacker, spanning from a full white-box attack to the gray-box threat model described in the original work. To evaluate the robustness of the defense against an adaptive attacker, we consider the targeted-attack success rate of the Projected Gradient Descent (PGD) attack, which is a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Bacillus and Francisella bacterial research