The Controllability Trap: A Governance Framework for Military AI Agents
Subramanyam Sahoo

TL;DR
This paper introduces the Agentic Military AI Governance Framework (AMAGF), a structured approach to maintaining human control over autonomous military AI systems through real-time metrics and multi-layered governance.
Contribution
It proposes a novel governance architecture with measurable control metrics and concrete mechanisms to address control failures in agentic military AI systems.
Findings
Control Quality Score (CQS) enables real-time control assessment
Framework formalizes responsibilities across five institutional actors
Operational scenario demonstrates practical implementation
Abstract
Agentic AI systems - capable of goal interpretation, world modeling, planning, tool use, long-horizon operation, and autonomous coordination - introduce distinct control failures not addressed by existing safety frameworks. We identify six agentic governance failures tied to these capabilities and show how they erode meaningful human control in military settings. We propose the Agentic Military AI Governance Framework (AMAGF), a measurable architecture structured around three pillars: Preventive Governance (reducing failure likelihood), Detective Governance (real-time detection of control degradation), and Corrective Governance (restoring or safely degrading operations). Its core mechanism, the Control Quality Score (CQS), is a composite real-time metric quantifying human control and enabling graduated responses as control weakens. For each failure type, we define concrete mechanisms,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Automation Interaction and Safety · Ethics and Social Impacts of AI · Occupational Health and Safety Research
