Robust Upper Bounds for Adversarial Training

Dimitris Bertsimas; Xavier Boix; Kimberly Villalobos Carballo; Dick; den Hertog

arXiv:2112.09279·cs.LG·April 7, 2023

Robust Upper Bounds for Adversarial Training

Dimitris Bertsimas, Xavier Boix, Kimberly Villalobos Carballo, Dick, den Hertog

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new holistic approach to adversarial training that minimizes an upper bound of the adversarial loss, resulting in more robust models, especially against larger perturbations, by leveraging tools from Robust Optimization.

Contribution

The paper proposes a novel method for adversarial training based on a holistic upper bound, improving robustness over existing layer-wise bounds and providing both empirical and provable guarantees.

Findings

01

RUB outperforms state-of-the-art methods on larger perturbations.

02

aRUB matches performance of existing methods on small perturbations.

03

The approach is effective across tabular and vision datasets.

Abstract

Many state-of-the-art adversarial training methods for deep learning leverage upper bounds of the adversarial loss to provide security guarantees against adversarial attacks. Yet, these methods rely on convex relaxations to propagate lower and upper bounds for intermediate layers, which affect the tightness of the bound at the output layer. We introduce a new approach to adversarial training by minimizing an upper bound of the adversarial loss that is based on a holistic expansion of the network instead of separate bounds for each layer. This bound is facilitated by state-of-the-art tools from Robust Optimization; it has closed-form and can be effectively trained using backpropagation. We derive two new methods with the proposed approach. The first method (Approximated Robust Upper Bound or aRUB) uses the first order approximation of the network as well as basic tools from Linear Robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kimvc7/robustness
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications