An Existence Proof for Neural Language Models That Can Explain Garden-Path Effects via Surprisal

Ryo Yoshida; Shinnosuke Isono; Taiga Someya; Yohei Oseki; Tatsuki Kuribayashi

arXiv:2604.18293·cs.CL·April 21, 2026

An Existence Proof for Neural Language Models That Can Explain Garden-Path Effects via Surprisal

Ryo Yoshida, Shinnosuke Isono, Taiga Someya, Yohei Oseki, Tatsuki Kuribayashi

PDF

TL;DR

This paper demonstrates that neural language models can be fine-tuned to account for garden-path effects in sentence processing, supporting surprisal theory's applicability to complex syntactic disambiguation.

Contribution

It provides an existence proof that neural LMs can be adapted to explain garden-path effects via surprisal, challenging previous limitations.

Findings

01

Fine-tuned LMs capture human reading slowdowns on garden-path sentences.

02

Fine-tuned LMs improve prediction of human reading times on naturalistic data.

03

Fine-tuned LMs do not overfit and maintain general language modeling capabilities.

Abstract

Surprisal theory hypothesizes that the difficulty of human sentence processing increases linearly with surprisal, the negative log-probability of a word given its context. Computational psycholinguistics has tested this hypothesis using language models (LMs) as proxies for human prediction. While surprisal derived from recent neural LMs generally captures human processing difficulty on naturalistic corpora that predominantly consist of simple sentences, it severely underestimates processing difficulty on sentences that require syntactic disambiguation (garden-path effects). This leads to the claim that the processing difficulty of such sentences cannot be reduced to surprisal, although it remains possible that neural LMs simply differ from humans in next-word prediction. In this paper, we investigate whether it is truly impossible to construct a neural LM that can explain garden-path…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.