How Well Do Deep Learning Models Capture Human Concepts? The Case of the   Typicality Effect

Siddhartha K. Vemuri; Raj Sanjay Shah; Sashank Varma

arXiv:2405.16128·cs.AI·May 28, 2024

How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect

Siddhartha K. Vemuri, Raj Sanjay Shah, Sashank Varma

PDF

Open Access

TL;DR

This study evaluates how well deep learning models, including language, vision, and multimodal models, replicate human typicality judgments across a broad set of concepts, revealing that multimodal models align best.

Contribution

It expands behavioral evaluation of models by testing multiple architectures and concepts, and introduces a new image set for assessing vision model conceptual alignment.

Findings

01

Language models better match human typicality judgments than vision models.

02

Combined language and vision models outperform individual models.

03

Multimodal models like CLIP ViT show strong potential in explaining human typicality.

Abstract

How well do representations learned by ML models align with those of humans? Here, we consider concept representations learned by deep learning models and evaluate whether they show a fundamental behavioral signature of human concepts, the typicality effect. This is the finding that people judge some instances (e.g., robin) of a category (e.g., Bird) to be more typical than others (e.g., penguin). Recent research looking for human-like typicality effects in language and vision models has focused on models of a single modality, tested only a small number of concepts, and found only modest correlations with human typicality ratings. The current study expands this behavioral evaluation of models by considering a broader range of language (N = 8) and vision (N = 10) model architectures. It also evaluates whether the combined typicality predictions of vision + language model pairs, as well…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI)

MethodsSparse Evolutionary Training · ALIGN · Contrastive Language-Image Pre-training