Language Statistics and False Belief Reasoning: Evidence from 41 Open-Weight LMs
Sean Trott, Samuel Taylor, Cameron Jones, James A. Michaelov, Pamela D. Rivi\`ere

TL;DR
This study evaluates 41 open-weight language models on false belief reasoning tasks, revealing that larger models show greater sensitivity and that language distributional statistics may partly explain human-like attribution biases.
Contribution
It extends prior work by assessing a large, diverse set of open-weight LMs on false belief tasks, providing new insights into model sensitivity and human cognition hypotheses.
Findings
34% of models show sensitivity to implied knowledge states
Larger models exhibit higher sensitivity and predictive power
Humans and LMs both attribute false beliefs more when cued with non-factive verbs
Abstract
Research on mental state reasoning in language models (LMs) has the potential to inform theories of human social cognition--such as the theory that mental state reasoning emerges in part from language exposure--and our understanding of LMs themselves. Yet much published work on LMs relies on a relatively small sample of closed-source LMs, limiting our ability to rigorously test psychological theories and evaluate LM capacities. Here, we replicate and extend published work on the false belief task by assessing LM mental state reasoning behavior across 41 open-weight models (from distinct model families). We find sensitivity to implied knowledge states in 34% of the LMs tested; however, consistent with prior work, none fully ``explain away'' the effect in humans. Larger LMs show increased sensitivity and also exhibit higher psychometric predictive power. Finally, we use LM behavior to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeurobiology of Language and Bilingualism · Language and cultural evolution · Embodied and Extended Cognition
