Scaling provable adversarial defenses
Eric Wong, Frank R. Schmidt, Jan Hendrik Metzen, J. Zico Kolter

TL;DR
This paper advances scalable provable adversarial defenses for larger neural networks by extending training methods to complex architectures, introducing efficient nonlinear random projections, and employing cascade models, achieving state-of-the-art robustness on MNIST and CIFAR.
Contribution
It introduces a modular approach for training robust networks with skip connections, a linear-scaling nonlinear random projection method for ReLU networks, and a cascade model technique to improve robustness.
Findings
Achieved 3.1% robust error on MNIST with $\, ext{l}_ ext{infty}$ perturbations.
Reduced robust error on CIFAR from 80% to 36.4%.
Extended provable defenses to larger, more complex network architectures.
Abstract
Recent work has developed methods for learning deep network classifiers that are provably robust to norm-bounded adversarial perturbation; however, these methods are currently only possible for relatively small feedforward networks. In this paper, in an effort to scale these approaches to substantially larger models, we extend previous work in three main directions. First, we present a technique for extending these training procedures to much more general networks, with skip connections (such as ResNets) and general nonlinearities; the approach is fully modular, and can be implemented automatically (analogous to automatic differentiation). Second, in the specific case of adversarial perturbations and networks with ReLU nonlinearities, we adopt a nonlinear random projection for training, which scales linearly in the number of hidden units (previous approaches scaled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis
Methods*Communicated@Fast*How Do I Communicate to Expedia?
