Tracking Naturalistic Linguistic Predictions with Deep Neural Language Models
Micha Heilbron, Benedikt Ehinger, Peter Hagoort, Floris P. de Lange

TL;DR
This study uses advanced neural language models to analyze linguistic predictability in naturalistic speech, showing that long-context models better align with neural responses and challenge previous simplistic assumptions about prediction in language comprehension.
Contribution
It extends prior research by employing a neural language model that considers extensive context, demonstrating its effectiveness in predicting neural responses during natural language processing.
Findings
Neural predictability estimates better fit EEG data than simpler models
Strong surprise responses observed similar to P200 and N400
Long-context models reveal more accurate linguistic prediction effects
Abstract
Prediction in language has traditionally been studied using simple designs in which neural responses to expected and unexpected words are compared in a categorical fashion. However, these designs have been contested as being `prediction encouraging', potentially exaggerating the importance of prediction in language understanding. A few recent studies have begun to address these worries by using model-based approaches to probe the effects of linguistic predictability in naturalistic stimuli (e.g. continuous narrative). However, these studies so far only looked at very local forms of prediction, using models that take no more than the prior two words into account when computing a word's predictability. Here, we extend this approach using a state-of-the-art neural language model that can take roughly 500 times longer linguistic contexts into account. Predictability estimates from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
