Parameterizing Activation Functions for Adversarial Robustness
Sihui Dai, Saeed Mahloujifar, Prateek Mittal

TL;DR
This paper explores how learnable parametric activation functions (PAFs) can enhance adversarial robustness in neural networks, demonstrating that specific activation shape properties and minimal parameter additions can significantly improve robustness.
Contribution
It introduces a new PAF, PSSiLU, and shows how incorporating PAFs with adversarial training improves robustness with minimal additional parameters.
Findings
PSSiLU improves robust accuracy by 4.54% over ReLU on CIFAR-10 with ResNet-18.
Smooth PAFs with few parameters significantly increase robustness.
PSSiLU achieves state-of-the-art robust accuracy on RobustBench.
Abstract
Deep neural networks are known to be vulnerable to adversarially perturbed inputs. A commonly used defense is adversarial training, whose performance is influenced by model capacity. While previous works have studied the impact of varying model width and depth on robustness, the impact of increasing capacity by using learnable parametric activation functions (PAFs) has not been studied. We study how using learnable PAFs can improve robustness in conjunction with adversarial training. We first ask the question: how should we incorporate parameters into activation functions to improve robustness? To address this, we analyze the direct impact of activation shape on robustness through PAFs and observe that activation shapes with positive outputs on negative inputs and with high finite curvature can increase robustness. We combine these properties to create a new PAF, which we call…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsPart Affinity Fields · Parameterized ReLU
