Scaling provable adversarial defenses

Eric Wong; Frank R. Schmidt; Jan Hendrik Metzen; J. Zico Kolter

arXiv:1805.12514·cs.LG·November 26, 2018·120 cites

Scaling provable adversarial defenses

Eric Wong, Frank R. Schmidt, Jan Hendrik Metzen, J. Zico Kolter

PDF

Open Access 4 Repos

TL;DR

This paper advances scalable provable adversarial defenses for larger neural networks by extending training methods to complex architectures, introducing efficient nonlinear random projections, and employing cascade models, achieving state-of-the-art robustness on MNIST and CIFAR.

Contribution

It introduces a modular approach for training robust networks with skip connections, a linear-scaling nonlinear random projection method for ReLU networks, and a cascade model technique to improve robustness.

Findings

01

Achieved 3.1% robust error on MNIST with $\, ext{l}_ ext{infty}$ perturbations.

02

Reduced robust error on CIFAR from 80% to 36.4%.

03

Extended provable defenses to larger, more complex network architectures.

Abstract

Recent work has developed methods for learning deep network classifiers that are provably robust to norm-bounded adversarial perturbation; however, these methods are currently only possible for relatively small feedforward networks. In this paper, in an effort to scale these approaches to substantially larger models, we extend previous work in three main directions. First, we present a technique for extending these training procedures to much more general networks, with skip connections (such as ResNets) and general nonlinearities; the approach is fully modular, and can be implemented automatically (analogous to automatic differentiation). Second, in the specific case of $ℓ_{\infty}$ adversarial perturbations and networks with ReLU nonlinearities, we adopt a nonlinear random projection for training, which scales linearly in the number of hidden units (previous approaches scaled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis

Methods*Communicated@Fast*How Do I Communicate to Expedia?