Low Curvature Activations Reduce Overfitting in Adversarial Training
Vasu Singla, Sahil Singla, David Jacobs, Soheil Feizi

TL;DR
This paper demonstrates that using activation functions with low curvature can significantly reduce overfitting and the generalization gap in adversarial training, improving robustness and preventing double descent.
Contribution
It reveals the impact of low-curvature activation functions on reducing overfitting and generalization gaps in adversarial training, including non-smooth activations like LeakyReLU.
Findings
Low-curvature activations reduce standard and robust generalization gaps.
Activation functions with low curvature prevent the double descent phenomenon.
Both smooth and non-smooth low-curvature activations have regularization effects.
Abstract
Adversarial training is one of the most effective defenses against adversarial attacks. Previous works suggest that overfitting is a dominant phenomenon in adversarial training leading to a large generalization gap between test and train accuracy in neural networks. In this work, we show that the observed generalization gap is closely related to the choice of the activation function. In particular, we show that using activation functions with low (exact or approximate) curvature values has a regularization effect that significantly reduces both the standard and robust generalization gaps in adversarial training. We observe this effect for both differentiable/smooth activations such as SiLU as well as non-differentiable/non-smooth activations such as LeakyReLU. In the latter case, the "approximate" curvature of the activation is low. Finally, we show that for activation functions with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOrthopedic Surgery and Rehabilitation · Spine and Intervertebral Disc Pathology
MethodsSigmoid Linear Unit · Sigmoid Activation · (FiLe@Against@Claim)How do I file a claim against Expedia?
