Fixed Inter-Neuron Covariability Induces Adversarial Robustness
Muhammad Ahmed Shah, Bhiksha Raj

TL;DR
This paper introduces the SCA layer, which enforces fixed covariability among neuron activations, leading to improved adversarial robustness in neural networks without adversarial training.
Contribution
The paper proposes the Self-Consistent Activation (SCA) layer that constrains neuron covariability, enhancing adversarial robustness in DNNs beyond existing methods.
Findings
Models with SCA layers outperform standard models against Auto-PGD attacks.
SCA layers improve robustness without adversarial training.
The approach is effective on image and sound recognition tasks.
Abstract
The vulnerability to adversarial perturbations is a major flaw of Deep Neural Networks (DNNs) that raises question about their reliability when in real-world scenarios. On the other hand, human perception, which DNNs are supposed to emulate, is highly robust to such perturbations, indicating that there may be certain features of the human perception that make it robust but are not represented in the current class of DNNs. One such feature is that the activity of biological neurons is correlated and the structure of this correlation tends to be rather rigid over long spans of times, even if it hampers performance and learning. We hypothesize that integrating such constraints on the activations of a DNN would improve its adversarial robustness, and, to test this hypothesis, we have developed the Self-Consistent Activation (SCA) layer, which comprises of neurons whose activations are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Machine Learning in Materials Science
MethodsSemantic Cross Attention
