How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect
Siddhartha K. Vemuri, Raj Sanjay Shah, Sashank Varma

TL;DR
This study evaluates how well deep learning models, including language, vision, and multimodal models, replicate human typicality judgments across a broad set of concepts, revealing that multimodal models align best.
Contribution
It expands behavioral evaluation of models by testing multiple architectures and concepts, and introduces a new image set for assessing vision model conceptual alignment.
Findings
Language models better match human typicality judgments than vision models.
Combined language and vision models outperform individual models.
Multimodal models like CLIP ViT show strong potential in explaining human typicality.
Abstract
How well do representations learned by ML models align with those of humans? Here, we consider concept representations learned by deep learning models and evaluate whether they show a fundamental behavioral signature of human concepts, the typicality effect. This is the finding that people judge some instances (e.g., robin) of a category (e.g., Bird) to be more typical than others (e.g., penguin). Recent research looking for human-like typicality effects in language and vision models has focused on models of a single modality, tested only a small number of concepts, and found only modest correlations with human typicality ratings. The current study expands this behavioral evaluation of models by considering a broader range of language (N = 8) and vision (N = 10) model architectures. It also evaluates whether the combined typicality predictions of vision + language model pairs, as well…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
MethodsSparse Evolutionary Training · ALIGN · Contrastive Language-Image Pre-training
