On the Extreme Variance of Certified Local Robustness Across Model Seeds

Minh Le; Phuong Cao

arXiv:2601.13303·cs.LG·April 6, 2026

On the Extreme Variance of Certified Local Robustness Across Model Seeds

Minh Le, Phuong Cao

PDF

TL;DR

This paper reveals that the certified robustness of neural networks varies extremely across different training seeds and datasets, raising concerns about the reliability of current robustness verification methods.

Contribution

It demonstrates the significant variance in certified robustness due to random seed choices and dataset differences, highlighting the need for more reliable verification practices.

Findings

01

Certified robustness variance exceeds recent robustness improvements.

02

Robustness generalization to unseen data is highly inconsistent.

03

Results challenge the dependability of current robustness verification methods.

Abstract

Robustness verification of neural networks, referring to formally proving that neural networks satisfy robustness properties, is of crucial importance in safety-critical applications, where model failures can result in loss of human life or million-dollar damages. However, the dependability of verification results may be questioned due to sources of randomness in machine learning, and although this has been widely investigated for accuracy, its impact on robustness verification remains unknown. In this paper, we demonstrate a concerning result: Models that differ only in random seeds during training exhibit extreme variance in their certified robustness, with a standard deviation that is statistically larger than the marginal robustness improvements reported in recent machine learning papers. In addition, we also show that certified robustness generalization to unseen data varies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.