Designing escalation criteria for international AI incident response: criteria, triggers, and thresholds
Francesca Gomez, Matthew Ball, Michael Harre, Lydia Preston, Josephine Schwab, Caio Machado

TL;DR
This paper develops an operational escalation framework for AI incidents to facilitate international coordination, analyzing criteria, triggers, and thresholds based on regulatory and incident data.
Contribution
It introduces a set of eight criteria for escalation decisions, maps them to existing regulations, and tests the framework against real incidents to identify detection gaps.
Findings
Under-detection occurs when escalation requires confirmed harm.
Systemic risks from accumulated incidents are often under-detected.
Thresholds based on legal terms can be impractical under time pressure.
Abstract
AI incident reporting requirements are emerging in regulation and policy, yet no operational criteria exist for determining when a detected AI incident warrants escalation beyond national handling to international coordination. This paper proposes an escalation framework to address this gap, intended as a common reference point across jurisdictions that enables aligned escalation while preserving flexibility in how actors respond within their own legal and policy contexts. We review SB 53, the EU AI Act, the GPAI Code of Practice, and incident frameworks from other industries to derive eight criteria for assessing whether an incident warrants escalation, translated into a sequential flowchart with gated decision points and threshold checks. For each criterion, we map how it interplays with these regulatory frameworks, identifying where their design choices support or undermine effective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
