Shedding More Light on Robust Classifiers under the lens of Energy-based Models
Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini, Iacopo, Masi

TL;DR
This paper reinterprets robust classifiers as Energy-based Models to analyze adversarial training dynamics, revealing new insights and proposing a novel weighted training scheme that improves robustness and generative capabilities.
Contribution
It offers a new energy landscape perspective on adversarial training, introduces WEAT for improved robustness, and demonstrates enhanced generative performance of classifiers.
Findings
Adversarial attacks produce in-distribution adversarial images with lower energy.
TRADES implicitly aligns natural and adversarial energies to prevent overfitting.
State-of-the-art robust classifiers smooth the energy landscape, improving robustness and generative quality.
Abstract
By reinterpreting a robust discriminative classifier as Energy-based Model (EBM), we offer a new take on the dynamics of adversarial training (AT). Our analysis of the energy landscape during AT reveals that untargeted attacks generate adversarial images much more in-distribution (lower energy) than the original data from the point of view of the model. Conversely, we observe the opposite for targeted attacks. On the ground of our thorough analysis, we present new theoretical and practical results that show how interpreting AT energy dynamics unlocks a better understanding: (1) AT dynamic is governed by three phases and robust overfitting occurs in the third phase with a drastic divergence between natural and adversarial energies (2) by rewriting the loss of TRadeoff-inspired Adversarial DEfense via Surrogate-loss minimization (TRADES) in terms of energies, we show that TRADES…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
