Quantifying Automation Risk in High-Automation AI Systems: A Bayesian Framework for Failure Propagation and Optimal Oversight

Vishal Srivastava; Tanmay Sah

arXiv:2602.18986·cs.AI·February 24, 2026

Quantifying Automation Risk in High-Automation AI Systems: A Bayesian Framework for Failure Propagation and Optimal Oversight

Vishal Srivastava, Tanmay Sah

PDF

Open Access

TL;DR

This paper introduces a Bayesian framework to quantify and analyze the risks of failure propagation in high-automation AI systems, providing theoretical tools for better risk management and oversight.

Contribution

It develops a formal Bayesian risk decomposition and theoretical foundations for assessing failure propagation and optimal oversight in automated AI systems.

Findings

01

Formal proof of the risk decomposition

02

Harm propagation equivalence theorem

03

Guidelines for optimal resource allocation

Abstract

Organizations across finance, healthcare, transportation, content moderation, and critical infrastructure are rapidly deploying highly automated AI systems, yet they lack principled methods to quantify how increasing automation amplifies harm when failures occur. We propose a parsimonious Bayesian risk decomposition expressing expected loss as the product of three terms: the probability of system failure, the conditional probability that a failure propagates into harm given the automation level, and the expected severity of harm. This framework isolates a critical quantity -- the conditional probability that failures propagate into harm -- which captures execution and oversight risk rather than model accuracy alone. We develop complete theoretical foundations: formal proofs of the decomposition, a harm propagation equivalence theorem linking the harm propagation probability to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Software System Performance and Reliability · Human-Automation Interaction and Safety