It's Simplex! Disaggregating Measures to Improve Certified Robustness
Andrew C. Cullen, Paul Montague, Shijie Liu, Sarah M. Erfani, and Benjamin I.P. Rubinstein

TL;DR
This paper introduces disaggregated measures to better evaluate and enhance certified robustness, leading to methods that certify significantly more samples and potentially double the certification radius.
Contribution
It proposes two novel approaches for analyzing certification performance at the sample level, enabling more accurate assessment and improved certification methods.
Findings
Certifies 9% more samples at noise scale σ=1.
Potential to more than double the certification radius.
Improved certification performance on complex tasks.
Abstract
Certified robustness circumvents the fragility of defences against adversarial attacks, by endowing model predictions with guarantees of class invariance for attacks up to a calculated size. While there is value in these certifications, the techniques through which we assess their performance do not present a proper accounting of their strengths and weaknesses, as their analysis has eschewed consideration of performance over individual samples in favour of aggregated measures. By considering the potential output space of certified models, this work presents two distinct approaches to improve the analysis of certification mechanisms, that allow for both dataset-independent and dataset-dependent measures of certification performance. Embracing such a perspective uncovers new certification approaches, which have the potential to more than double the achievable radius of certification,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
