Why do deep convolutional networks generalize so poorly to small image transformations?
Aharon Azulay, Yair Weiss

TL;DR
This paper investigates why CNNs fail to generalize to small image transformations despite assumptions of invariance, revealing limitations of architecture and data augmentation, and exploring potential partial solutions.
Contribution
The paper demonstrates that CNNs do not inherently possess invariance to small transformations due to architectural and training limitations, and evaluates potential solutions.
Findings
CNNs are sensitive to small image transformations.
Architectural invariance is limited by sampling theorem considerations.
Data augmentation alone does not ensure invariance.
Abstract
Convolutional Neural Networks (CNNs) are commonly assumed to be invariant to small image transformations: either because of the convolutional architecture or because they were trained using data augmentation. Recently, several authors have shown that this is not the case: small translations or rescalings of the input image can drastically change the network's prediction. In this paper, we quantify this phenomena and ask why neither the convolutional architecture nor data augmentation are sufficient to achieve the desired invariance. Specifically, we show that the convolutional architecture does not give invariance since architectures ignore the classical sampling theorem, and data augmentation does not give invariance because the CNNs learn to be invariant to transformations only for images that are very similar to typical images from the training set. We discuss two possible solutions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
