Aligning Machine and Human Visual Representations across Abstraction Levels

Lukas Muttenthaler; Klaus Greff; Frieda Born; Bernhard Spitzer; Simon Kornblith; Michael C. Mozer; Klaus-Robert M\"uller; Thomas Unterthiner; Andrew K. Lampinen

arXiv:2409.06509·cs.CV·September 4, 2025·2 cites

Aligning Machine and Human Visual Representations across Abstraction Levels

Lukas Muttenthaler, Klaus Greff, Frieda Born, Bernhard Spitzer, Simon Kornblith, Michael C. Mozer, Klaus-Robert M\"uller, Thomas Unterthiner, Andrew K. Lampinen

PDF

Open Access

TL;DR

This paper proposes a method to align neural network representations with human conceptual hierarchies by finetuning models with human judgment data, improving their alignment with human behavior and robustness.

Contribution

The authors introduce a novel finetuning approach using a teacher model trained on human judgments to enhance model alignment with human conceptual structures.

Findings

01

Human-aligned models better match human similarity judgments

02

Aligned models show improved out-of-distribution robustness

03

Enhanced models perform better on diverse machine learning tasks

Abstract

Deep neural networks have achieved success across a wide range of applications, including as models of human behavior and neural representations in vision tasks. However, neural network training and human learning differ in fundamental ways, and neural networks often fail to generalize as robustly as humans do raising questions regarding the similarity of their underlying representations. What is missing for modern learning systems to exhibit more human-aligned behavior? We highlight a key misalignment between vision models and humans: whereas human conceptual knowledge is hierarchically organized from fine- to coarse-scale distinctions, model representations do not accurately capture all these levels of abstraction. To address this misalignment, we first train a teacher model to imitate human judgments, then transfer human-aligned structure from its representations to refine the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics · Human Motion and Animation · Social Robot Interaction and HRI

MethodsSparse Evolutionary Training