The Seeds of Scheming: Weakness of Will in the Building Blocks of Agentic Systems

Robert Yang

arXiv:2512.05449·cs.AI·December 8, 2025

The Seeds of Scheming: Weakness of Will in the Building Blocks of Agentic Systems

Robert Yang

PDF

Open Access 1 Video

TL;DR

This paper introduces the concept of akrasia to analyze inconsistency in agentic AI systems, proposing a benchmark to measure models' self-control and exploring implications for multi-agent stability.

Contribution

It formalizes akrasia as a foundational concept for understanding AI inconsistency and introduces the Akrasia Benchmark for quantitative assessment.

Findings

01

Benchmark effectively measures model self-control

02

Models exhibit varying levels of akrasia across conditions

03

Potential macro-level instability in multi-agent systems

Abstract

Large language models display a peculiar form of inconsistency: they "know" the correct answer but fail to act on it. In human philosophy, this tension between global judgment and local impulse is called akrasia, or weakness of will. We propose akrasia as a foundational concept for analyzing inconsistency and goal drift in agentic AI systems. To operationalize it, we introduce a preliminary version of the Akrasia Benchmark, currently a structured set of prompting conditions (Baseline [B], Synonym [S], Temporal [T], and Temptation [X]) that measures when a model's local response contradicts its own prior commitments. The benchmark enables quantitative comparison of "self-control" across model families, decoding strategies, and temptation types. Beyond single-model evaluation, we outline how micro-level akrasia may compound into macro-level instability in multi-agent systems that may be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

The Seeds of Scheming: Weakness of Will in the Building Blocks of Agentic Systems· underline

Taxonomy

TopicsEmbodied and Extended Cognition · Action Observation and Synchronization · Language and cultural evolution