On the Effectiveness of Interval Bound Propagation for Training Verifiably Robust Models
Sven Gowal, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel,, Chongli Qin, Jonathan Uesato, Relja Arandjelovic, Timothy Mann, Pushmeet, Kohli

TL;DR
This paper demonstrates that a simple interval bound propagation technique can effectively train large, provably robust neural networks, outperforming complex methods and achieving state-of-the-art verified accuracy on multiple datasets.
Contribution
The authors show how IBP, a straightforward bounding method, can be used with specific loss functions and hyper-parameters to train large robust models efficiently.
Findings
IBP-based training outperforms more complex methods.
Achieved state-of-the-art verified accuracy on MNIST, CIFAR-10, and SVHN.
Trained the largest verified model on a downscaled ImageNet.
Abstract
Recent work has shown that it is possible to train deep neural networks that are provably robust to norm-bounded adversarial perturbations. Most of these methods are based on minimizing an upper bound on the worst-case loss over all possible adversarial perturbations. While these techniques show promise, they often result in difficult optimization procedures that remain hard to scale to larger networks. Through a comprehensive analysis, we show how a simple bounding technique, interval bound propagation (IBP), can be exploited to train large provably robust neural networks that beat the state-of-the-art in verified accuracy. While the upper bound computed by IBP can be quite weak for general networks, we demonstrate that an appropriate loss and clever hyper-parameter schedule allow the network to adapt such that the IBP bound is tight. This results in a fast and stable learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Fault Detection and Control Systems
