Modeling structure-building in the brain with CCG parsing and large language models
Milo\v{s} Stanojevi\'c, Jonathan R. Brennan, Donald Dunagan and, Mark Steedman, John T. Hale

TL;DR
This study investigates whether Combinatory Categorial Grammars (CCGs), with their greater expressive power, better model neural signals during language comprehension than context-free grammars (CFGs), using fMRI data from audiobook listening.
Contribution
It demonstrates that CCG-based measures provide a superior fit to neural signals in the left posterior temporal lobe compared to CFGs, highlighting the importance of expressive grammatical models in neural language processing.
Findings
CCG structure-building correlates with neural signals in the left posterior temporal lobe.
CCG measures outperform CFG in modeling neural responses during naturalistic listening.
Neural effects of structure-building are distinct from predictability effects in the brain.
Abstract
To model behavioral and neural correlates of language comprehension in naturalistic environments researchers have turned to broad-coverage tools from natural-language processing and machine learning. Where syntactic structure is explicitly modeled, prior work has relied predominantly on context-free grammars (CFG), yet such formalisms are not sufficiently expressive for human languages. Combinatory Categorial Grammars (CCGs) are sufficiently expressive directly compositional models of grammar with flexible constituency that affords incremental interpretation. In this work we evaluate whether a more expressive CCG provides a better model than a CFG for human neural signals collected with fMRI while participants listen to an audiobook story. We further test between variants of CCG that differ in how they handle optional adjuncts. These evaluations are carried out against a baseline that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeurobiology of Language and Bilingualism · Natural Language Processing Techniques · Language and cultural evolution
MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Softmax · Adam · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Absolute Position Encodings
