Do Deep Neural Networks Suffer from Crowding?
Anna Volokitin, Gemma Roig, Tomaso Poggio

TL;DR
This paper investigates how deep neural networks experience crowding effects similar to humans, revealing that eccentricity-dependent models are more robust to flankers, and identifies factors influencing crowding in neural networks.
Contribution
The study introduces an eccentricity-dependent neural network model inspired by the primate visual cortex and compares its crowding resilience to standard DCNNs, highlighting the impact of training and architecture.
Findings
Eccentricity-dependent models recognize targets near the center despite flankers.
Standard DCNNs' accuracy decreases with closer flankers and more flankers.
Pooling in early layers increases crowding effects.
Abstract
Crowding is a visual effect suffered by humans, in which an object that can be recognized in isolation can no longer be recognized when other objects, called flankers, are placed close to it. In this work, we study the effect of crowding in artificial Deep Neural Networks for object recognition. We analyze both standard deep convolutional neural networks (DCNNs) as well as a new version of DCNNs which is 1) multi-scale and 2) with size of the convolution filters change depending on the eccentricity wrt to the center of fixation. Such networks, that we call eccentricity-dependent, are a computational model of the feedforward path of the primate visual cortex. Our results reveal that the eccentricity-dependent model, trained on target objects in isolation, can recognize such targets in the presence of flankers, if the targets are near the center of the image, whereas DCNNs cannot. Also,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Visual perception and processing mechanisms · Face Recognition and Perception
