So Cloze yet so Far: N400 Amplitude is Better Predicted by   Distributional Information than Human Predictability Judgements

James A. Michaelov; Seana Coulson; Benjamin K. Bergen

arXiv:2109.01226·cs.CL·May 26, 2022

So Cloze yet so Far: N400 Amplitude is Better Predicted by Distributional Information than Human Predictability Judgements

James A. Michaelov, Seana Coulson, Benjamin K. Bergen

PDF

TL;DR

This study compares human and computational language model predictions of N400 amplitude, finding that models like GPT-3, RoBERTa, and ALBERT better predict neural responses than human predictability judgments, highlighting the influence of surface-level language statistics.

Contribution

It demonstrates that computational models' predictions align more closely with neural responses than human judgments, challenging assumptions about human predictive processing in language comprehension.

Findings

01

Language models better predict N400 amplitude than humans.

02

Surface-level statistical information influences neural responses more than human predictability.

03

Computational models' predictions correlate strongly with neural data.

Abstract

More predictable words are easier to process - they are read faster and elicit smaller neural signals associated with processing difficulty, most notably, the N400 component of the event-related brain potential. Thus, it has been argued that prediction of upcoming words is a key component of language comprehension, and that studying the amplitude of the N400 is a valuable way to investigate the predictions we make. In this study, we investigate whether the linguistic predictions of computational language models or humans better reflect the way in which natural language stimuli modulate the amplitude of the N400. One important difference in the linguistic predictions of humans versus computational language models is that while language models base their predictions exclusively on the preceding linguistic context, humans may rely on other factors. We find that the predictions of three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Linear Warmup With Linear Decay · Layer Normalization · LAMB · Residual Connection · BERT · Dropout · Softmax · Attention Dropout