Understanding and Increasing Efficiency of Frank-Wolfe Adversarial Training
Theodoros Tsiligkaridis, Jay Roberts

TL;DR
This paper introduces a theoretical framework for adversarial training using Frank-Wolfe optimization, revealing geometric insights and developing an efficient adaptive training algorithm that balances robustness and training time.
Contribution
It provides a geometric connection between loss landscape and attack distortion, and proposes FW-AT-Adapt, a novel adaptive adversarial training method based on Frank-Wolfe optimization.
Findings
FW attacks achieve near maximal distortion on robust models.
Catastrophic overfitting correlates with low FW attack distortion.
FW-AT-Adapt reduces training time while maintaining robustness.
Abstract
Deep neural networks are easily fooled by small perturbations known as adversarial attacks. Adversarial Training (AT) is a technique that approximately solves a robust optimization problem to minimize the worst-case loss and is widely regarded as the most effective defense. Due to the high computation time for generating strong adversarial examples in the AT process, single-step approaches have been proposed to reduce training time. However, these methods suffer from catastrophic overfitting where adversarial accuracy drops during training, and although improvements have been proposed, they increase training time and robustness is far from that of multi-step AT. We develop a theoretical framework for adversarial training with FW optimization (FW-AT) that reveals a geometric connection between the loss landscape and the distortion of FW attacks. We analytically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
