Evaluating Assurance Cases as Text-Attributed Graphs for Structure and Provenance Analysis
Fariz Ikhwantri, Dusica Marijan

TL;DR
This paper introduces a graph-based framework using GNNs to analyze assurance cases, focusing on link prediction and provenance detection to improve understanding and trustworthiness of safety-critical documentation.
Contribution
It presents a novel graph diagnostic framework with a publicly available dataset, demonstrating effective GNN-based link prediction and provenance detection for assurance cases.
Findings
GNNs achieve ROC-AUC 0.760 in link prediction on assurance case graphs.
GNNs distinguish human from LLM-generated cases with F1 0.94.
LLM-generated assurance cases show different hierarchical linking patterns.
Abstract
An assurance case is a structured argument document that justifies claims about a system's requirements or properties, which are supported by evidence. In regulated domains, these are crucial for meeting compliance and safety requirements to industry standards. We propose a graph diagnostic framework for analysing the structure and provenance of assurance cases. We focus on two main tasks: (1) link prediction, to learn and identify connections between argument elements, and (2) graph classification, to differentiate between assurance cases created by a state-of-the-art large language model and those created by humans, aiming to detect bias. We compiled a publicly available dataset of assurance cases, represented as graphs with nodes and edges, supporting both link prediction and provenance analysis. Experiments show that graph neural networks (GNNs) achieve strong link prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
