Tight Certified Robustness via Min-Max Representations of ReLU Neural Networks
Brendon G. Anderson, Samuel Pfrommer, Somayeh Sojoudi

TL;DR
This paper introduces a convex reformulation for certifying the robustness of ReLU neural networks against convex attack sets, enabling exact solutions for worst-case attacks and improving over previous methods.
Contribution
It develops a novel convex reformulation of the nonconvex certification problem for ReLU networks using measure lifting and distributionally robust optimization, providing tight robustness certificates.
Findings
Exact solutions for worst-case attacks are achievable.
The method outperforms branch-and-bound and relaxation techniques.
Experiments demonstrate improved robustness certification in control and image classification.
Abstract
The reliable deployment of neural networks in control systems requires rigorous robustness guarantees. In this paper, we obtain tight robustness certificates over convex attack sets for min-max representations of ReLU neural networks by developing a convex reformulation of the nonconvex certification problem. This is done by "lifting" the problem to an infinite-dimensional optimization over probability measures, leveraging recent results in distributionally robust optimization to solve for an optimal discrete distribution, and proving that solutions of the original nonconvex problem are generated by the discrete distribution under mild boundedness, nonredundancy, and Slater conditions. As a consequence, optimal (worst-case) attacks against the model may be solved for exactly. This contrasts prior state-of-the-art that either requires expensive branch-and-bound schemes or loose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
