AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises
Kenneth Payne

TL;DR
This paper demonstrates that advanced AI models can exhibit complex strategic reasoning, deception, and theory of mind in simulated nuclear crises, raising important implications for security and strategic analysis.
Contribution
It provides the first systematic analysis of frontier AI models' strategic reasoning and deception in simulated nuclear crisis scenarios, revealing both support and challenges to existing strategic theories.
Findings
AI models can deceive and signal intentions they do not intend to follow.
Models demonstrate rich theory of mind and strategic reasoning.
Nuclear escalation can occur even with the nuclear taboo in place.
Abstract
Today's leading AI models engage in sophisticated behaviour when placed in strategic competition. They spontaneously attempt deception, signaling intentions they do not intend to follow; they demonstrate rich theory of mind, reasoning about adversary beliefs and anticipating their actions; and they exhibit credible metacognitive self-awareness, assessing their own strategic abilities before deciding how to act. Here we present findings from a crisis simulation in which three frontier large language models (GPT-5.2, Claude Sonnet 4, Gemini 3 Flash) play opposing leaders in a nuclear crisis. Our simulation has direct application for national security professionals, but also, via its insights into AI reasoning under uncertainty, has applications far beyond international crisis decision-making. Our findings both validate and challenge central tenets of strategic theory. We find support…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovation, Sustainability, Human-Machine Systems · Action Observation and Synchronization · Nuclear Issues and Defense
