Where AI Assurance Might Go Wrong: Initial lessons from engineering of critical systems
Robin Bloomfield, John Rushby

TL;DR
This paper examines how traditional safety engineering principles from critical systems can inform AI safety assurance, highlighting challenges, gaps, and the need for more rigorous, scalable assurance methods tailored to AI systems.
Contribution
It analyzes critical systems safety practices and discusses their applicability to AI safety, emphasizing the importance of broad system boundaries, risk elaboration, and assurance theories.
Findings
Critical system safety practices can inform AI assurance frameworks.
Current assurance methods lack scalability and theoretical foundations for AI.
System boundaries and risk tolerability need broader elaboration.
Abstract
We draw on our experience working on system and software assurance and evaluation for systems important to society to summarise how safety engineering is performed in traditional critical systems, such as aircraft flight control. We analyse how this critical systems perspective might support the development and implementation of AI Safety Frameworks. We present the analysis in terms of: system engineering, safety and risk analysis, and decision analysis and support. We consider four key questions: What is the system? How good does it have to be? What is the impact of criticality on system development? and How much should we trust it? We identify topics worthy of further discussion. In particular, we are concerned that system boundaries are not broad enough, that the tolerability and nature of the risks are not sufficiently elaborated, and that the assurance methods lack theories that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSafety Systems Engineering in Autonomy · Risk and Safety Analysis · Adversarial Robustness in Machine Learning
