Adversarial flows: A gradient flow characterization of adversarial attacks

Lukas Weigand; Tim Roith; Martin Burger

arXiv:2406.05376·cs.LG·September 17, 2025

Adversarial flows: A gradient flow characterization of adversarial attacks

Lukas Weigand, Tim Roith, Martin Burger

PDF

1 Repo

TL;DR

This paper provides a mathematical framework interpreting adversarial attack methods as gradient flows, demonstrating convergence and characterizing the inner optimization in adversarial training through differential inclusions and Wasserstein gradient flows.

Contribution

It introduces a novel interpretation of adversarial attacks as gradient flows and establishes convergence results, connecting discrete attacks to continuous differential inclusions.

Findings

01

Fast gradient sign method as Euler discretization of gradient flow

02

Convergence of normalized gradient descent methods to the flow

03

Characterization of adversarial training inner optimization via differential inclusions

Abstract

A popular method to perform adversarial attacks on neuronal networks is the so-called fast gradient sign method and its iterative variant. In this paper, we interpret this method as an explicit Euler discretization of a differential inclusion, where we also show convergence of the discretization to the associated gradient flow. To do so, we consider the concept of p-curves of maximal slope in the case $p = \infty$ . We prove existence of $\infty$ -curves of maximum slope and derive an alternative characterization via differential inclusions. Furthermore, we also consider Wasserstein gradient flows for potential energies, where we show that curves in the Wasserstein space can be characterized by a representing measure on the space of curves in the underlying Banach space, which fulfill the differential inclusion. The application of our theory to the finite-dimensional setting is twofold: On…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

timroith/adversarialflows
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.