Adversarial training may be a double-edged sword

Ali Rahmati; Seyed-Mohsen Moosavi-Dezfooli; Huaiyu Dai

arXiv:2107.11671·cs.LG·July 27, 2021

Adversarial training may be a double-edged sword

Ali Rahmati, Seyed-Mohsen Moosavi-Dezfooli, Huaiyu Dai

PDF

Open Access

TL;DR

This paper investigates the limitations of adversarial training, revealing that it can inadvertently favor certain black-box attacks and may not always enhance robustness against all attack types, especially decision-based black-box attacks.

Contribution

It introduces a metric called robustness gain and analyzes how adversarial training affects the decision boundary and attack convergence in deep networks.

Findings

01

Adversarial training improves white-box robustness significantly.

02

It offers limited robustness gains against decision-based black-box attacks.

03

White-box attacks converge faster on adversarially-trained networks.

Abstract

Adversarial training has been shown as an effective approach to improve the robustness of image classifiers against white-box attacks. However, its effectiveness against black-box attacks is more nuanced. In this work, we demonstrate that some geometric consequences of adversarial training on the decision boundary of deep networks give an edge to certain types of black-box attacks. In particular, we define a metric called robustness gain to show that while adversarial training is an effective method to dramatically improve the robustness in white-box scenarios, it may not provide such a good robustness gain against the more realistic decision-based black-box attacks. Moreover, we show that even the minimal perturbation white-box attacks can converge faster against adversarially-trained neural networks compared to the regular ones.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Anomaly Detection Techniques and Applications