Boosting Adversarial Robustness From The Perspective of Effective Margin Regularization
Ziquan Liu, Antoni B. Chan

TL;DR
This paper introduces an effective margin regularization (EMR) technique that enhances adversarial robustness of deep neural networks by controlling weight norms, outperforming existing methods on large-scale models and when combined with other defenses.
Contribution
The paper proposes a novel effective margin regularization method that improves adversarial robustness by regularizing weight norms during training, addressing the scale-variant property of cross-entropy loss.
Findings
EMR learns larger effective margins and boosts robustness.
EMR outperforms basic adversarial training, TRADES, and regularization baselines.
Combining EMR with strong defenses further enhances robustness.
Abstract
The adversarial vulnerability of deep neural networks (DNNs) has been actively investigated in the past several years. This paper investigates the scale-variant property of cross-entropy loss, which is the most commonly used loss function in classification tasks, and its impact on the effective margin and adversarial robustness of deep neural networks. Since the loss function is not invariant to logit scaling, increasing the effective weight norm will make the loss approach zero and its gradient vanish while the effective margin is not adequately maximized. On typical DNNs, we demonstrate that, if not properly regularized, the standard training does not learn large effective margins and leads to adversarial vulnerability. To maximize the effective margins and learn a robust DNN, we propose to regularize the effective weight norm during training. Our empirical study on feedforward DNNs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
