Do Large Language Models know what humans know?

Sean Trott; Cameron Jones; Tyler Chang; James Michaelov; Benjamin; Bergen

arXiv:2209.01515·cs.CL·June 2, 2023·1 cites

Do Large Language Models know what humans know?

Sean Trott, Cameron Jones, Tyler Chang, James Michaelov, Benjamin, Bergen

PDF

Open Access 1 Repo

TL;DR

This study investigates whether large language models like GPT-3 can understand others' mental states, finding they show some sensitivity but do not fully replicate human belief attribution, implying additional mechanisms are involved.

Contribution

The paper provides empirical evidence that language exposure alone partially explains theory of mind development, highlighting the need for other mechanisms in humans.

Findings

01

GPT-3 exceeds chance in false belief tasks

02

Humans outperform GPT-3 in belief attribution

03

Language exposure alone does not fully account for human theory of mind

Abstract

Humans can attribute beliefs to others. However, it is unknown to what extent this ability results from an innate biological endowment or from experience accrued through child development, particularly exposure to language describing others' mental states. We test the viability of the language exposure hypothesis by assessing whether models exposed to large quantities of human language display sensitivity to the implied knowledge states of characters in written passages. In pre-registered analyses, we present a linguistic version of the False Belief Task to both human participants and a Large Language Model, GPT-3. Both are sensitive to others' beliefs, but while the language model significantly exceeds chance behavior, it does not perform as well as the humans, nor does it explain the full extent of their behavior -- despite being exposed to more language than a human would in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ucsd-language-and-cognition-lab/nlm-fb
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Cosine Annealing · Weight Decay · Dropout · 15 Ways to Contact How can i speak to someone at Delta Airlines · Refunds@Expedia|||How do I get a full refund from Expedia? · Byte Pair Encoding