Shedding More Light on Robust Classifiers under the lens of Energy-based   Models

Mujtaba Hussain Mirza; Maria Rosaria Briglia; Senad Beadini; Iacopo; Masi

arXiv:2407.06315·cs.CV·September 11, 2024

Shedding More Light on Robust Classifiers under the lens of Energy-based Models

Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini, Iacopo, Masi

PDF

Open Access 1 Repo

TL;DR

This paper reinterprets robust classifiers as Energy-based Models to analyze adversarial training dynamics, revealing new insights and proposing a novel weighted training scheme that improves robustness and generative capabilities.

Contribution

It offers a new energy landscape perspective on adversarial training, introduces WEAT for improved robustness, and demonstrates enhanced generative performance of classifiers.

Findings

01

Adversarial attacks produce in-distribution adversarial images with lower energy.

02

TRADES implicitly aligns natural and adversarial energies to prevent overfitting.

03

State-of-the-art robust classifiers smooth the energy landscape, improving robustness and generative quality.

Abstract

By reinterpreting a robust discriminative classifier as Energy-based Model (EBM), we offer a new take on the dynamics of adversarial training (AT). Our analysis of the energy landscape during AT reveals that untargeted attacks generate adversarial images much more in-distribution (lower energy) than the original data from the point of view of the model. Conversely, we observe the opposite for targeted attacks. On the ground of our thorough analysis, we present new theoretical and practical results that show how interpreting AT energy dynamics unlocks a better understanding: (1) AT dynamic is governed by three phases and robust overfitting occurs in the third phase with a drastic divergence between natural and adversarial energies (2) by rewriting the loss of TRadeoff-inspired Adversarial DEfense via Surrogate-loss minimization (TRADES) in terms of energies, we show that TRADES…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

omnai-lab/robust-classifiers-under-the-lens-of-ebm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications