Tracking translation invariance in CNNs
Johannes C. Myburgh, Coenraad Mouton, Marelie H. Davel

TL;DR
This paper investigates how architectural choices in CNNs, such as kernel size and feature map dimensions, affect their translation invariance, revealing that convolutional layers contribute less to invariance than previously thought.
Contribution
It introduces translation-sensitivity maps with Cosine Similarity to quantify translation invariance and analyzes the impact of kernel size and feature map dimensions on CNN translation robustness.
Findings
Kernel size and feature map size systematically influence translation invariance.
Convolutional layers contribute less to invariance than expected when not explicitly designed for it.
Translation invariance varies across different layers within CNN architectures.
Abstract
Although Convolutional Neural Networks (CNNs) are widely used, their translation invariance (ability to deal with translated inputs) is still subject to some controversy. We explore this question using translation-sensitivity maps to quantify how sensitive a standard CNN is to a translated input. We propose the use of Cosine Similarity as sensitivity metric over Euclidean Distance, and discuss the importance of restricting the dimensionality of either of these metrics when comparing architectures. Our main focus is to investigate the effect of different architectural components of a standard CNN on that network's sensitivity to translation. By varying convolutional kernel sizes and amounts of zero padding, we control the size of the feature maps produced, allowing us to quantify the extent to which these elements influence translation invariance. We also measure translation invariance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
