Dissociable neural representations of adversarially perturbed images in convolutional neural networks and the human brain
Chi Zhang, Xiaohan Duan, Linyuan Wang, Yongli Li, Bin Yan, Guoen Hu,, Ruyuan Zhang, Li Tong

TL;DR
This study compares how the human brain and CNNs process adversarial images, revealing fundamental differences in neural representations and perceptual recognition, which can guide future neural network development.
Contribution
It provides a detailed comparison of neural representations of adversarial images in humans and CNNs, highlighting key differences and suggesting directions for improving CNN models.
Findings
Humans recognize AI images as categories but see AN images as noise.
CNNs recognize AN images but misclassify AI images with high confidence.
Human brain representations align with perceptual similarity, unlike CNNs.
Abstract
Despite the remarkable similarities between convolutional neural networks (CNN) and the human brain, CNNs still fall behind humans in many visual tasks, indicating that there still exist considerable differences between the two systems. Here, we leverage adversarial noise (AN) and adversarial interference (AI) images to quantify the consistency between neural representations and perceptual outcomes in the two systems. Humans can successfully recognize AI images as corresponding categories but perceive AN images as meaningless noise. In contrast, CNNs can correctly recognize AN images but mistakenly classify AI images into wrong categories with surprisingly high confidence. We use functional magnetic resonance imaging to measure brain activity evoked by regular and adversarial images in the human brain, and compare it to the activity of artificial neurons in a prototypical CNN-AlexNet.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
