Testing the Predictions of Surprisal Theory in 11 Languages
Ethan Gotlieb Wilcox, Tiago Pimentel, Clara Meister, Ryan Cotterell,, Roger P. Levy

TL;DR
This study provides crosslinguistic evidence supporting Surprisal Theory by analyzing reading times and predictability in eleven diverse languages, confirming key theoretical predictions across language families.
Contribution
It is the first comprehensive multilingual analysis validating Surprisal Theory's predictions on reading times across diverse languages.
Findings
Surprisal predicts reading times in all examined languages.
Expected surprisal (entropy) also predicts reading times crosslinguistically.
The link between surprisal and reading times is linear across languages.
Abstract
A fundamental result in psycholinguistics is that less predictable words take a longer time to process. One theoretical explanation for this finding is Surprisal Theory (Hale, 2001; Levy, 2008), which quantifies a word's predictability as its surprisal, i.e. its negative log-probability given a context. While evidence supporting the predictions of Surprisal Theory have been replicated widely, most have focused on a very narrow slice of data: native English speakers reading English texts. Indeed, no comprehensive multilingual analysis exists. We address this gap in the current literature by investigating the relationship between surprisal and reading times in eleven different languages, distributed across five language families. Deriving estimates from language models trained on monolingual and multilingual corpora, we test three predictions associated with surprisal theory: (i) whether…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
