A Structured Approach to Safety Case Construction for AI Systems
Sung Une Lee, Liming Zhu, Md Shamsujjoha, Liming Dong, Qinghua Lu, Jieshan Chen, Lionel Briand

TL;DR
This paper presents a systematic, adaptable framework for constructing safety cases for AI systems, addressing their unique challenges like unpredictability and evolving behaviors, to improve safety governance.
Contribution
It introduces comprehensive taxonomies and a reusable safety-case template specifically designed for AI systems, overcoming limitations of traditional safety practices.
Findings
Developed AI-specific claim and argument taxonomies
Created a reusable safety-case template for AI systems
Demonstrated patterns for addressing evaluation and risk in AI safety
Abstract
Safety cases, structured arguments that a system is acceptably safe, are becoming central to the governance of AI systems. Yet, traditional safety-case practices from aviation or nuclear engineering rely on well-specified system boundaries, stable architectures, and known failure modes. Modern AI systems, such as generative and agentic AI, are the opposite. Their capabilities emerge unpredictably from low-level training objectives, their behaviour varies with prompts, and their risk profiles shift through fine-tuning, scaffolding, or deployment context. This study examines how safety cases are currently constructed for AI systems and why classical approaches fail to capture these dynamics. This study introduces comprehensive taxonomies for AI-specific claim types (assertion-based, constraint-based, capability-based), argument types (demonstrative, comparative, causal/explanatory,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSafety Systems Engineering in Autonomy · Ethics and Social Impacts of AI · Adversarial Robustness in Machine Learning
