Does simple trump complex? Comparing strategies for adversarial robustness in DNNs
William Brooks, Marelie H. Davel, Coenraad Mouton

TL;DR
This paper compares simple and complex margin-based adversarial training methods for DNNs, analyzing their components to determine which most effectively improve robustness against attacks like AutoAttack and PGD.
Contribution
It systematically isolates and evaluates components of two margin-based adversarial training methods to identify key factors enhancing robustness.
Findings
Simple margin maximization significantly improves robustness.
Complex methods offer marginal gains over simpler approaches.
Certain components contribute more to adversarial resilience.
Abstract
Deep Neural Networks (DNNs) have shown substantial success in various applications but remain vulnerable to adversarial attacks. This study aims to identify and isolate the components of two different adversarial training techniques that contribute most to increased adversarial robustness, particularly through the lens of margins in the input space -- the minimal distance between data points and decision boundaries. Specifically, we compare two methods that maximize margins: a simple approach which modifies the loss function to increase an approximation of the margin, and a more complex state-of-the-art method (Dynamics-Aware Robust Training) which builds upon this approach. Using a VGG-16 model as our base, we systematically isolate and evaluate individual components from these methods to determine their relative impact on adversarial robustness. We assess the effect of each component…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
