Quantifying Automation Risk in High-Automation AI Systems: A Bayesian Framework for Failure Propagation and Optimal Oversight
Vishal Srivastava, Tanmay Sah

TL;DR
This paper introduces a Bayesian framework to quantify and analyze the risks of failure propagation in high-automation AI systems, providing theoretical tools for better risk management and oversight.
Contribution
It develops a formal Bayesian risk decomposition and theoretical foundations for assessing failure propagation and optimal oversight in automated AI systems.
Findings
Formal proof of the risk decomposition
Harm propagation equivalence theorem
Guidelines for optimal resource allocation
Abstract
Organizations across finance, healthcare, transportation, content moderation, and critical infrastructure are rapidly deploying highly automated AI systems, yet they lack principled methods to quantify how increasing automation amplifies harm when failures occur. We propose a parsimonious Bayesian risk decomposition expressing expected loss as the product of three terms: the probability of system failure, the conditional probability that a failure propagates into harm given the automation level, and the expected severity of harm. This framework isolates a critical quantity -- the conditional probability that failures propagate into harm -- which captures execution and oversight risk rather than model accuracy alone. We develop complete theoretical foundations: formal proofs of the decomposition, a harm propagation equivalence theorem linking the harm propagation probability to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Software System Performance and Reliability · Human-Automation Interaction and Safety
