Do language models learn typicality judgments from text?
Kanishka Misra, Allyson Ettinger, Julia Taylor Rayz

TL;DR
This study assesses whether language models trained solely on text can learn typicality judgments, finding modest similarities to human cognition but indicating that textual data alone may be insufficient.
Contribution
The paper introduces two novel tests to evaluate typicality effects in language models, bridging cognitive science and NLP.
Findings
LM probabilities are modulated by typicality in category assignments
LMs show some sensitivity to typicality in extending new information
Text-only training yields limited typicality knowledge in LMs
Abstract
Building on research arguing for the possibility of conceptual and categorical knowledge acquisition through statistics contained in language, we evaluate predictive language models (LMs) -- informed solely by textual input -- on a prevalent phenomenon in cognitive science: typicality. Inspired by experiments that involve language processing and show robust typicality effects in humans, we propose two tests for LMs. Our first test targets whether typicality modulates LM probabilities in assigning taxonomic category memberships to items. The second test investigates sensitivities to typicality in LMs' probabilities when extending new information about items to their categories. Both tests show modest -- but not completely absent -- correspondence between LMs and humans, suggesting that text-based exposure alone is insufficient to acquire typicality knowledge.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Language and cultural evolution
