Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at Trivial Alterations to the False Belief Task?
Zhiqiang Pi, Annapurna Vadaparty, Benjamin K. Bergen, and Cameron R. Jones

TL;DR
This paper introduces SCALPEL, a method to analyze why Large Language Models fail at trivial alterations of the False Belief task, revealing they lack certain common-sense inferences crucial for robust Theory of Mind.
Contribution
The paper presents SCALPEL, a novel incremental stimulus modification technique, and applies it to identify specific reasoning failures in LLMs related to common-sense understanding.
Findings
LLMs often fail to recognize that seeing a transparent container implies knowing its contents
SCALPEL helps identify specific reasoning gaps in LLMs' Theory of Mind capabilities
Modern LLMs are not yet robust enough to fully emulate human-like Theory of Mind
Abstract
Recent empirical results have sparked a debate about whether or not Large Language Models (LLMs) are capable of Theory of Mind (ToM). While some have found LLMs to be successful on ToM evaluations such as the False Belief task, others have shown that their performance is not robust against trivial alterations to stimuli. In this paper, we introduce SCALPEL -- a technique to incrementally modify stimuli to test different specific hypotheses about why LLMs fail -- and apply this method to the "transparent-access" modification of the unexpected contents task. Our results suggest that LLMs often do poorly because they fail to make essential common-sense inferences, such as that seeing a transparent container implies recognizing its contents. We conclude that while modern LLMs go beyond mere pattern matching, they still fall short of robust human-like ToM. We argue that SCALPEL can help…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference
