Toward Risk Thresholds for AI-Enabled Cyber Threats: Enhancing Decision-Making Under Uncertainty with Bayesian Networks
Krystal Jackson, Deepika Raman, Jessica Newman, Nada Madkour, Charlotte Yuan, and Evan R. Murphy

TL;DR
This paper introduces a probabilistic, evidence-based framework using Bayesian networks to better determine and evaluate risk thresholds for AI-enabled cyber threats, improving decision-making under uncertainty.
Contribution
It proposes a structured Bayesian network approach for modeling AI cyber risks, addressing limitations of current threshold methods, and demonstrates its application through a case study on AI-augmented phishing.
Findings
Bayesian networks enable integration of diverse evidence and uncertainty.
Analysis of industry thresholds reveals common elements and gaps.
Case study shows improved risk estimation for AI-driven phishing.
Abstract
Artificial intelligence (AI) is increasingly being used to augment and automate cyber operations, altering the scale, speed, and accessibility of malicious activity. These shifts raise urgent questions about when AI systems introduce unacceptable or intolerable cyber risk, and how risk thresholds should be identified before harms materialize at scale. In recent years, industry, government, and civil society actors have begun to articulate such thresholds for advanced AI systems, with the goal of signaling when models meaningfully amplify cyber threats, for example, by automating multi-stage intrusions, enabling zero-day discovery, or lowering the expertise required for sophisticated attacks. However, current approaches to determine these thresholds remain fragmented and limited. Many thresholds rely solely on capability benchmarks or narrow threat scenarios, and are weakly connected to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation and Cyber Security · Adversarial Robustness in Machine Learning · Network Security and Intrusion Detection
