Identifying Layers Susceptible to Adversarial Attacks

Shoaib Ahmed Siddiqui; Thomas Breuel

arXiv:2107.04827·cs.LG·November 1, 2021

Identifying Layers Susceptible to Adversarial Attacks

Shoaib Ahmed Siddiqui, Thomas Breuel

PDF

Open Access

TL;DR

This study explores how different layers of neural networks contribute to vulnerability against adversarial attacks, revealing that early layers are more susceptible and crucial for robustness.

Contribution

It demonstrates that robustness is primarily linked to low-level feature extraction layers, challenging the focus on high-level layers for defense strategies.

Findings

01

Susceptibility to adversarial samples is linked to early layers.

02

Retraining high-level layers alone is insufficient for robustness.

03

Adversarial samples produce statistically different features in early layers.

Abstract

In this paper, we investigate the use of pretraining with adversarial networks, with the objective of discovering the relationship between network depth and robustness. For this purpose, we selectively retrain different portions of VGG and ResNet architectures on CIFAR-10, Imagenette, and ImageNet using non-adversarial and adversarial data. Experimental results show that susceptibility to adversarial samples is associated with low-level feature extraction layers. Therefore, retraining of high-level layers is insufficient for achieving robustness. Furthermore, adversarial attacks yield outputs from early layers that differ statistically from features for non-adversarial samples and do not permit consistent classification by subsequent layers. This supports common hypotheses regarding the association of robustness with the feature extractor, insufficiency of deeper layers in providing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Integrated Circuits and Semiconductor Failure Analysis

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Residual Connection · Average Pooling · Global Average Pooling · 1x1 Convolution · Kaiming Initialization · Residual Block · Bottleneck Residual Block · Max Pooling