First line of defense: A robust first layer mitigates adversarial attacks
Janani Suresh, Nancy Nayak, and Sheetal Kalyani

TL;DR
This paper introduces a robust first layer design for neural networks that acts as an implicit adversarial noise filter, improving adversarial robustness without the need for adversarial training.
Contribution
The authors propose a novel first layer architecture with large kernels, more filters, and maxpooling that enhances robustness across multiple models and datasets.
Findings
Achieves higher adversarial accuracy than existing natively robust models.
Produces smoother loss surfaces and better decision margins.
Attenuates high-frequency noise and improves denoising capabilities.
Abstract
Adversarial training (AT) incurs significant computational overhead, leading to growing interest in designing inherently robust architectures. We demonstrate that a carefully designed first layer of the neural network can serve as an implicit adversarial noise filter (ANF). This filter is created using a combination of large kernel size, increased convolution filters, and a maxpool operation. We show that integrating this filter as the first layer in architectures such as ResNet, VGG, and EfficientNet results in adversarially robust networks. Our approach achieves higher adversarial accuracies than existing natively robust architectures without AT and is competitive with adversarial-trained architectures across a wide range of datasets. Supporting our findings, we show that (a) the decision regions for our method have better margins, (b) the visualized loss surfaces are smoother, (c)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForensic Fingerprint Detection Methods
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Depthwise Convolution · Sigmoid Activation · Pointwise Convolution · Global Average Pooling · Kaiming Initialization · Depthwise Separable Convolution · 1x1 Convolution · Dense Connections · Squeeze-and-Excitation Block
