Robust Upper Bounds for Adversarial Training
Dimitris Bertsimas, Xavier Boix, Kimberly Villalobos Carballo, Dick, den Hertog

TL;DR
This paper introduces a new holistic approach to adversarial training that minimizes an upper bound of the adversarial loss, resulting in more robust models, especially against larger perturbations, by leveraging tools from Robust Optimization.
Contribution
The paper proposes a novel method for adversarial training based on a holistic upper bound, improving robustness over existing layer-wise bounds and providing both empirical and provable guarantees.
Findings
RUB outperforms state-of-the-art methods on larger perturbations.
aRUB matches performance of existing methods on small perturbations.
The approach is effective across tabular and vision datasets.
Abstract
Many state-of-the-art adversarial training methods for deep learning leverage upper bounds of the adversarial loss to provide security guarantees against adversarial attacks. Yet, these methods rely on convex relaxations to propagate lower and upper bounds for intermediate layers, which affect the tightness of the bound at the output layer. We introduce a new approach to adversarial training by minimizing an upper bound of the adversarial loss that is based on a holistic expansion of the network instead of separate bounds for each layer. This bound is facilitated by state-of-the-art tools from Robust Optimization; it has closed-form and can be effectively trained using backpropagation. We derive two new methods with the proposed approach. The first method (Approximated Robust Upper Bound or aRUB) uses the first order approximation of the network as well as basic tools from Linear Robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
