Bounding the Black Box: A Statistical Certification Framework for AI Risk Regulation
Natan Levy, Gadi Perl

TL;DR
This paper introduces a statistical certification framework inspired by aviation safety standards to quantitatively verify AI system safety for regulatory compliance.
Contribution
It proposes a two-stage process that enables regulators and developers to certify AI safety without needing internal model access, addressing a critical verification gap.
Findings
The framework provides an auditable upper bound on AI failure rates.
It aligns with existing legal and regulatory safety requirements.
The method scales to complex, opaque AI architectures.
Abstract
Artificial intelligence now decides who receives a loan, who is flagged for criminal investigation, and whether an autonomous vehicle brakes in time. Governments have responded: the EU AI Act, the NIST Risk Management Framework, and the Council of Europe Convention all demand that high-risk systems demonstrate safety before deployment. Yet beneath this regulatory consensus lies a critical vacuum: none specifies what ``acceptable risk'' means in quantitative terms, and none provides a technical method for verifying that a deployed system actually meets such a threshold. The regulatory architecture is in place; the verification instrument is not. This gap is not theoretical. As the EU AI Act moves into full enforcement, developers face mandatory conformity assessments without established methodologies for producing quantitative safety evidence - and the systems most in need of oversight…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
