Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents
Hsieh-Ting Lin, Tsung-Yu Hou

TL;DR
This study shows that large language models can develop Theory of Mind-like reasoning through extended interactive poker play, especially when equipped with persistent memory, leading to sophisticated opponent modeling and deception.
Contribution
It demonstrates that ToM-like behavior can emerge dynamically in LLMs via interaction and memory, without explicit training or prompts, advancing understanding of AI social cognition.
Findings
Memory is necessary and sufficient for ToM emergence in LLM agents.
Agents with memory develop recursive opponent models and strategic deception.
ToM-like reasoning correlates with memory presence, not domain knowledge.
Abstract
Theory of Mind (ToM) -- the ability to model others' mental states -- is fundamental to human social cognition. Whether large language models (LLMs) can develop ToM has been tested exclusively through static vignettes, leaving open whether ToM-like reasoning can emerge through dynamic interaction. Here we report that autonomous LLM agents playing extended sessions of Texas Hold'em poker progressively develop sophisticated opponent models, but only when equipped with persistent memory. In a 2x2 factorial design crossing memory (present/absent) with domain knowledge (present/absent), each with five replications (N = 20 experiments, ~6,000 agent-hand observations), we find that memory is both necessary and sufficient for ToM-like behavior emergence (Cliff's delta = 1.0, p = 0.008). Agents with memory reach ToM Level 3-5 (predictive to recursive modeling), while agents without memory remain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
