Are Deep Neural Networks Adequate Behavioural Models of Human Visual Perception?
Felix A. Wichmann, Robert Geirhos

TL;DR
This paper reviews evidence on whether deep neural networks can serve as accurate behavioral models of human visual perception, concluding they are promising but not yet adequate models for human core object recognition.
Contribution
It clarifies the distinction between statistical tools and computational models and evaluates DNNs' current status as models of human visual perception.
Findings
DNNs are valuable scientific tools for vision research.
Current DNNs are promising but not fully adequate models of human core object recognition.
The paper dispels myths about DNNs in vision science.
Abstract
Deep neural networks (DNNs) are machine learning algorithms that have revolutionised computer vision due to their remarkable successes in tasks like object classification and segmentation. The success of DNNs as computer vision algorithms has led to the suggestion that DNNs may also be good models of human visual perception. We here review evidence regarding current DNNs as adequate behavioural models of human core object recognition. To this end, we argue that it is important to distinguish between statistical tools and computational models, and to understand model quality as a multidimensional concept where clarity about modelling goals is key. Reviewing a large number of psychophysical and computational explorations of core object recognition performance in humans and DNNs, we argue that DNNs are highly valuable scientific tools but that as of today DNNs should only be regarded as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace Recognition and Perception · Visual perception and processing mechanisms · Visual Attention and Saliency Detection
