Do Vision Models Encode Object-Level Semantic Relatedness? A Cognitive Psychology-Inspired Benchmark

Hansang Lee; Haeil Lee; Junmo Kim

arXiv:1709.03806·cs.CV·May 22, 2026·6 cites

Do Vision Models Encode Object-Level Semantic Relatedness? A Cognitive Psychology-Inspired Benchmark

Hansang Lee, Haeil Lee, Junmo Kim

PDF

TL;DR

This study evaluates whether modern vision models encode object-level semantic relatedness by using cognitive psychology-inspired benchmarks, revealing differences across model types and paradigms in recognizing semantic relationships.

Contribution

It introduces two new benchmarks, POPORO and PoporoIN, to assess semantic relatedness encoding in vision models, bridging cognitive psychology and AI evaluation.

Findings

01

Transformer models outperform convolutional models in semantic relatedness tasks.

02

Vision-language models surpass vision-only models in recognizing taxonomic relationships.

03

Models more reliably recognize taxonomic than thematic relationships and are more distracted by shape matches.

Abstract

Modern vision models have achieved strong object-recognition performance, yet it remains unclear whether their representations encode object-level semantic relatedness, the meaningful connection between object concepts that supports human visual cognition. Existing benchmarks predominantly target category prediction or rely on image--text matching, leaving the visual representation itself underexamined. Drawing on cognitive psychology, we recast semantic relatedness as a triplet-ranking task and study two image-only test beds: POPORO, an existing 400-triplet psychological stimulus set repurposed for representation evaluation, and PoporoIN, a newly constructed and manually curated 1,000-triplet ImageNet-validation extension. Each triplet is annotated along two orthogonal axes: a related-target axis distinguishing Categorical Relatedness (CR, taxonomic) from conTextual Relatedness (TR,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications