Dual Alignment Between Language Model Layers and Human Sentence Processing
Tatsuki Kuribayashi, Alex Warstadt, Yohei Oseki, Ethan Gotlieb Wilcox

TL;DR
This study investigates how different layers of large language models align with human sentence processing, especially in syntactically challenging contexts, revealing that earlier layers mimic natural reading, while later layers better model complex syntactic processing.
Contribution
It demonstrates that different layers of LLMs correspond to distinct modes of human sentence processing, especially under syntactic challenge, and introduces probability-update measures for improved modeling.
Findings
Earlier layers align with naturalistic reading behavior.
Later layers better estimate cognitive effort in syntactic ambiguity.
Probability-update measures offer complementary advantages in reading time prediction.
Abstract
A recent study (Kuribayashi et al., 2025) has shown that human sentence processing behavior, typically measured on syntactically unchallenging constructions, can be effectively modeled using surprisal from early layers of large language models (LLMs). This raises the question of whether such advantages of internal layers extend to more syntactically challenging constructions, where surprisal has been reported to underestimate human cognitive effort. In this paper, we begin by exploring internal layers that better estimate human cognitive effort observed in syntactic ambiguity processing in English. Our experiments show that, in contrast to naturalistic reading, later layers better estimate such a cognitive effort, but still underestimate the human data. This dual alignment sheds light on different modes of sentence processing in humans and LMs: naturalistic reading employs a somewhat…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
