Tight Certification of Adversarially Trained Neural Networks via Nonconvex Low-Rank Semidefinite Relaxations
Hong-Ming Chiu, Richard Y. Zhang

TL;DR
This paper introduces a nonconvex low-rank semidefinite relaxation method for certifying the robustness of adversarially trained neural networks, achieving high-quality certifications efficiently.
Contribution
It proposes a novel nonconvex certification approach that rivals SDP methods in quality while being computationally more efficient, overcoming existing LP relaxation barriers.
Findings
Almost closes the gap to exact certification
Achieves high-quality robustness certificates
Operates efficiently with polynomial-time algorithms
Abstract
Adversarial training is well-known to produce high-quality neural network models that are empirically robust against adversarial perturbations. Nevertheless, once a model has been adversarially trained, one often desires a certification that the model is truly robust against all future attacks. Unfortunately, when faced with adversarially trained models, all existing approaches have significant trouble making certifications that are strong enough to be practically useful. Linear programming (LP) techniques in particular face a "convex relaxation barrier" that prevent them from making high-quality certifications, even after refinement with mixed-integer linear programming (MILP) and branch-and-bound (BnB) techniques. In this paper, we propose a nonconvex certification technique, based on a low-rank restriction of a semidefinite programming (SDP) relaxation. The nonconvex relaxation makes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
