Do Large Language Models know what humans know?
Sean Trott, Cameron Jones, Tyler Chang, James Michaelov, Benjamin, Bergen

TL;DR
This study investigates whether large language models like GPT-3 can understand others' mental states, finding they show some sensitivity but do not fully replicate human belief attribution, implying additional mechanisms are involved.
Contribution
The paper provides empirical evidence that language exposure alone partially explains theory of mind development, highlighting the need for other mechanisms in humans.
Findings
GPT-3 exceeds chance in false belief tasks
Humans outperform GPT-3 in belief attribution
Language exposure alone does not fully account for human theory of mind
Abstract
Humans can attribute beliefs to others. However, it is unknown to what extent this ability results from an innate biological endowment or from experience accrued through child development, particularly exposure to language describing others' mental states. We test the viability of the language exposure hypothesis by assessing whether models exposed to large quantities of human language display sensitivity to the implied knowledge states of characters in written passages. In pre-registered analyses, we present a linguistic version of the False Belief Task to both human participants and a Large Language Model, GPT-3. Both are sensitive to others' beliefs, but while the language model significantly exceeds chance behavior, it does not perform as well as the humans, nor does it explain the full extent of their behavior -- despite being exposed to more language than a human would in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Cosine Annealing · Weight Decay · Dropout · 15 Ways to Contact How can i speak to someone at Delta Airlines · Refunds@Expedia|||How do I get a full refund from Expedia? · Byte Pair Encoding
