Measuring the tendency of CNNs to Learn Surface Statistical Regularities
Jason Jo, Yoshua Bengio

TL;DR
This paper investigates whether deep CNNs primarily learn surface statistical regularities rather than high-level abstractions, revealing a tendency to rely on dataset-specific Fourier image statistics which affects their generalization capabilities.
Contribution
The study introduces Fourier filtering to create datasets with preserved high-level features but altered surface statistics, providing a method to measure CNNs' reliance on surface regularities.
Findings
CNNs tend to latch onto Fourier image statistics of training data.
Significant generalization gaps up to 28% are observed across test sets.
Increasing network depth has minimal impact on reducing the generalization gap.
Abstract
Deep CNNs are known to exhibit the following peculiarity: on the one hand they generalize extremely well to a test set, while on the other hand they are extremely sensitive to so-called adversarial perturbations. The extreme sensitivity of high performance CNNs to adversarial examples casts serious doubt that these networks are learning high level abstractions in the dataset. We are concerned with the following question: How can a deep CNN that does not learn any high level semantics of the dataset manage to generalize so well? The goal of this article is to measure the tendency of CNNs to learn surface statistical regularities of the dataset. To this end, we use Fourier filtering to construct datasets which share the exact same high level abstractions but exhibit qualitatively different surface statistical regularities. For the SVHN and CIFAR-10 datasets, we present two Fourier…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
