Connecting metrics for shape-texture knowledge in computer vision
Tiago Oliveira, Tiago Marques, Arlindo L. Oliveira

TL;DR
This paper investigates how different neural network architectures rely on shape versus texture features in image classification, revealing correlations and competition between these features and their impact on model performance.
Contribution
It extends analysis of shape and texture bias across many architectures, showing their correlation with performance and the anti-correlation between shape and texture representations.
Findings
Model performance correlates with shape bias at output and penultimate layer.
Shape and texture representations are strongly anti-correlated.
Architecture families show significant variation in shape bias and performance.
Abstract
Modern artificial neural networks, including convolutional neural networks and vision transformers, have mastered several computer vision tasks, including object recognition. However, there are many significant differences between the behavior and robustness of these systems and of the human visual system. Deep neural networks remain brittle and susceptible to many changes in the image that do not cause humans to misclassify images. Part of this different behavior may be explained by the type of features humans and deep neural networks use in vision tasks. Humans tend to classify objects according to their shape while deep neural networks seem to rely mostly on texture. Exploring this question is relevant, since it may lead to better performing neural network architectures and to a better understanding of the workings of the vision system of primates. In this work, we advance the state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Medical Image Segmentation Techniques
