Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at Trivial Alterations to the False Belief Task?

Zhiqiang Pi; Annapurna Vadaparty; Benjamin K. Bergen; and Cameron R. Jones

arXiv:2406.14737·cs.CL·May 29, 2025·1 cites

Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at Trivial Alterations to the False Belief Task?

Zhiqiang Pi, Annapurna Vadaparty, Benjamin K. Bergen, and Cameron R. Jones

PDF

Open Access

TL;DR

This paper introduces SCALPEL, a method to analyze why Large Language Models fail at trivial alterations of the False Belief task, revealing they lack certain common-sense inferences crucial for robust Theory of Mind.

Contribution

The paper presents SCALPEL, a novel incremental stimulus modification technique, and applies it to identify specific reasoning failures in LLMs related to common-sense understanding.

Findings

01

LLMs often fail to recognize that seeing a transparent container implies knowing its contents

02

SCALPEL helps identify specific reasoning gaps in LLMs' Theory of Mind capabilities

03

Modern LLMs are not yet robust enough to fully emulate human-like Theory of Mind

Abstract

Recent empirical results have sparked a debate about whether or not Large Language Models (LLMs) are capable of Theory of Mind (ToM). While some have found LLMs to be successful on ToM evaluations such as the False Belief task, others have shown that their performance is not robust against trivial alterations to stimuli. In this paper, we introduce SCALPEL -- a technique to incrementally modify stimuli to test different specific hypotheses about why LLMs fail -- and apply this method to the "transparent-access" modification of the unexpected contents task. Our results suggest that LLMs often do poorly because they fail to make essential common-sense inferences, such as that seeing a transparent container implies recognizing its contents. We conclude that while modern LLMs go beyond mere pattern matching, they still fall short of robust human-like ToM. We argue that SCALPEL can help…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference