A unified information-theoretic model of EEG signatures of human language processing
Jiaxuan Li, Richard Futrell

TL;DR
This paper presents an information-theoretic model linking EEG signatures to two levels of language processing in the brain, validated by simulations matching experimental ERP data.
Contribution
It introduces a formal decomposition of linguistic surprisal into heuristic and discrepancy signals, correlating with N400 and P600 EEG components, using modern NLP techniques.
Findings
Successfully simulated ERP patterns from linguistic manipulations
Linked N400 to heuristic surprise and P600 to discrepancy signals
Validated model with experimental EEG data
Abstract
We advance an information-theoretic model of human language processing in the brain, in which incoming linguistic input is processed at two levels, in terms of a heuristic interpretation and in terms of error correction. We propose that these two kinds of information processing have distinct electroencephalographic signatures, corresponding to the well-documented N400 and P600 components of language-related event-related potentials (ERPs). Formally, we show that the information content (surprisal) of a word in context can be decomposed into two quantities: (A) heuristic surprise, which signals processing difficulty of word given its inferred context, and corresponds with the N400 signal; and (B) discrepancy signal, which reflects divergence between the true context and the inferred context, and corresponds to the P600 signal. Both of these quantities can be estimated using modern NLP…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Neurobiology of Language and Bilingualism
