Less is more: Probabilistic reduction is best explained by small-scale predictability measures
Cassandra L. Jacobs, Andr\'es Bux\'o-Lugo, Anna K. Taylor, Marie Leopold-Hooke

TL;DR
This paper explores how small-scale predictability measures, like n-grams, can effectively explain probabilistic reduction in language models, emphasizing the importance of minimal context for cognitive phenomena analysis.
Contribution
It demonstrates that n-gram representations are sufficient for understanding probabilistic reduction, challenging the need for larger context in modeling cognitive language processes.
Findings
N-gram models suffice for probabilistic reduction analysis
Small-scale predictability measures are effective in cognitive language modeling
Whole utterances are not necessary for observing probabilistic effects
Abstract
The primary research questions of this paper center on defining the amount of context that is necessary and/or appropriate when investigating the relationship between language model probabilities and cognitive phenomena. We investigate whether whole utterances are necessary to observe probabilistic reduction and demonstrate that n-gram representations suffice as cognitive units of planning.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Constraint Satisfaction and Optimization · Syntax, Semantics, Linguistic Variation
