Unveiling the Mystery of Visual Attributes of Concrete and Abstract Concepts: Variability, Nearest Neighbors, and Challenging Categories
Tarun Tater, Sabine Schulte im Walde, Diego Frassinelli

TL;DR
This study investigates how visual representations of concepts vary, focusing on concreteness, and compares simple visual features with advanced models for classifying and analyzing these representations across datasets.
Contribution
It provides a comparative analysis of visual features for distinguishing abstract and concrete concepts and highlights the importance of feature selection in multimodal understanding.
Findings
Color and texture features outperform ViT in classification tasks.
ViT models excel in nearest neighbor analysis of visual representations.
Challenging factors for variability include image complexity and context.
Abstract
The visual representation of a concept varies significantly depending on its meaning and the context where it occurs; this poses multiple challenges both for vision and multimodal models. Our study focuses on concreteness, a well-researched lexical-semantic variable, using it as a case study to examine the variability in visual representations. We rely on images associated with approximately 1,000 abstract and concrete concepts extracted from two different datasets: Bing and YFCC. Our goals are: (i) evaluate whether visual diversity in the depiction of concepts can reliably distinguish between concrete and abstract concepts; (ii) analyze the variability of visual features across multiple images of the same concept through a nearest neighbor analysis; and (iii) identify challenging factors contributing to this variability by categorizing and annotating images. Our findings indicate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDesign Education and Practice · Safety Warnings and Signage
MethodsDense Connections · Residual Connection · Dropout · Layer Normalization · Adam · Byte Pair Encoding · Absolute Position Encodings · Vision Transformer · Softmax · Attention Is All You Need
