Adversarial Robustness Against the Union of Multiple Perturbation Models
Pratyush Maini, Eric Wong, J. Zico Kolter

TL;DR
This paper introduces a generalized attack method that optimizes robustness against multiple adversarial perturbation models simultaneously, leading to improved defense performance on standard datasets.
Contribution
It proposes a new PGD-based approach that considers the worst-case over multiple perturbation models, enabling training of models robust against several attack types at once.
Findings
Achieves 47.0% adversarial accuracy on CIFAR10 against combined perturbations.
Outperforms previous methods with 40.6% accuracy on the union of attack models.
Effectively balances robustness across different perturbation types.
Abstract
Owing to the susceptibility of deep learning systems to adversarial attacks, there has been a great deal of work in developing (both empirically and certifiably) robust classifiers. While most work has defended against a single type of attack, recent work has looked at defending against multiple perturbation models using simple aggregations of multiple attacks. However, these methods can be difficult to tune, and can easily result in imbalanced degrees of robustness to individual perturbation models, resulting in a sub-optimal worst-case loss over the union. In this work, we develop a natural generalization of the standard PGD-based procedure to incorporate multiple perturbation models into a single attack, by taking the worst-case over all steepest descent directions. This approach has the advantage of directly converging upon a trade-off between different perturbation models which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research
