Less is more: Probabilistic reduction is best explained by small-scale predictability measures

Cassandra L. Jacobs; Andr\'es Bux\'o-Lugo; Anna K. Taylor; Marie Leopold-Hooke

arXiv:2512.23659·cs.CL·January 7, 2026

Less is more: Probabilistic reduction is best explained by small-scale predictability measures

Cassandra L. Jacobs, Andr\'es Bux\'o-Lugo, Anna K. Taylor, Marie Leopold-Hooke

PDF

Open Access

TL;DR

This paper explores how small-scale predictability measures, like n-grams, can effectively explain probabilistic reduction in language models, emphasizing the importance of minimal context for cognitive phenomena analysis.

Contribution

It demonstrates that n-gram representations are sufficient for understanding probabilistic reduction, challenging the need for larger context in modeling cognitive language processes.

Findings

01

N-gram models suffice for probabilistic reduction analysis

02

Small-scale predictability measures are effective in cognitive language modeling

03

Whole utterances are not necessary for observing probabilistic effects

Abstract

The primary research questions of this paper center on defining the amount of context that is necessary and/or appropriate when investigating the relationship between language model probabilities and cognitive phenomena. We investigate whether whole utterances are necessary to observe probabilistic reduction and demonstrate that n-gram representations suffice as cognitive units of planning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Constraint Satisfaction and Optimization · Syntax, Semantics, Linguistic Variation