The Loss of Control Playbook: Degrees, Dynamics, and Preparedness
Charlotte Stix, Annika Hallensleben, Alejandro Ortega, Matteo Pistillo

TL;DR
This paper develops a comprehensive taxonomy and preparedness framework for Loss of Control in AI, emphasizing societal vulnerability pathways and actionable governance and technical strategies to mitigate risks.
Contribution
It introduces a graded LoC taxonomy, models societal vulnerability pathways, and proposes an actionable DAP framework for prevention and preparedness.
Findings
Proposes a graded taxonomy based on severity and persistence.
Models societal pathways to AI-induced LoC.
Recommends governance and technical controls for risk mitigation.
Abstract
This research report addresses the absence of an actionable definition for Loss of Control (LoC) in AI systems by developing a novel taxonomy and preparedness framework. Despite increasing policy and research attention, existing LoC definitions vary significantly in scope and timeline, hindering effective LoC assessment and mitigation. To address this issue, we draw from an extensive literature review and propose a graded LoC taxonomy, based on the metrics of severity and persistence, that distinguishes between Deviation, Bounded LoC, and Strict LoC. We model pathways toward a societal state of vulnerability in which sufficiently advanced AI systems have acquired or could acquire the means to cause Bounded or Strict LoC once a catalyst, either misalignment or pure malfunction, materializes. We argue that this state becomes increasingly likely over time, absent strategic intervention,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Human-Automation Interaction and Safety · Ethics and Social Impacts of AI
