Sparta: Spatially Attentive and Adversarially Robust Activation
Qing Guo, Felix Juefei-Xu, Changqing Zhou, Wei Feng, Yang Liu, Song, Wang

TL;DR
This paper introduces Sparta, a novel activation function designed to enhance the adversarial robustness and accuracy of CNNs by addressing ReLU's limitations, demonstrating superior transferability and robustness across models and datasets.
Contribution
The paper proposes Sparta, a new activation function that improves adversarial robustness and accuracy of CNNs, with better transferability than existing functions.
Findings
Sparta achieves higher robustness and accuracy than ReLU.
Sparta exhibits superior transferability across CNN architectures.
Sparta trained on one dataset effectively improves robustness on another dataset.
Abstract
Adversarial training (AT) is one of the most effective ways for improving the robustness of deep convolution neural networks (CNNs). Just like common network training, the effectiveness of AT relies on the design of basic network components. In this paper, we conduct an in-depth study on the role of the basic ReLU activation component in AT for robust CNNs. We find that the spatially-shared and input-independent properties of ReLU activation make CNNs less robust to white-box adversarial attacks with either standard or adversarial training. To address this problem, we extend ReLU to a novel Sparta activation function (Spatially attentive and Adversarially Robust Activation), which enables CNNs to achieve both higher robustness, i.e., lower error rate on adversarial examples, and higher accuracy, i.e., lower error rate on clean examples, than the existing state-of-the-art (SOTA)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI
MethodsConvolution
