Do Wider Neural Networks Really Help Adversarial Robustness?
Boxi Wu, Jinghui Chen, Deng Cai, Xiaofei He, Quanquan Gu

TL;DR
This paper investigates how neural network width influences adversarial robustness, revealing that wider networks may require adjusted regularization parameters to achieve optimal robustness, and introduces a method to do so.
Contribution
The paper provides a theoretical and empirical analysis of the impact of network width on adversarial robustness and proposes a width-adjusted regularization method to improve robustness.
Findings
Wider networks tend to have worse perturbation stability at fixed regularization.
Properly enlarging the regularization parameter $\lambda$ enhances robustness of wider models.
The proposed WAR method adaptively adjusts $\lambda$ and improves robustness while reducing tuning time.
Abstract
Adversarial training is a powerful type of defense against adversarial examples. Previous empirical results suggest that adversarial training requires wider networks for better performances. However, it remains elusive how neural network width affects model robustness. In this paper, we carefully examine the relationship between network width and model robustness. Specifically, we show that the model robustness is closely related to the tradeoff between natural accuracy and perturbation stability, which is controlled by the robust regularization parameter . With the same , wider networks can achieve better natural accuracy but worse perturbation stability, leading to a potentially worse overall model robustness. To understand the origin of this phenomenon, we further relate the perturbation stability with the network's local Lipschitzness. By leveraging recent results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Anomaly Detection Techniques and Applications
