CNNs Avoid Curse of Dimensionality by Learning on Patches
Vamshi C. Madala, Shivkumar Chandrasekaran, Jason Bunk

TL;DR
This paper proposes a patch-based theoretical framework explaining CNNs' ability to generalize well in image classification, attributing it to their operation on image patches which mitigates the curse of dimensionality.
Contribution
It introduces the first a priori error bound for CNN generalization error based on patch processing, supported by quantitative and qualitative evidence.
Findings
CNNs operate on image patches, reducing effective dimensionality.
Patch-based theory explains effectiveness of data augmentation techniques.
Provides the first a priori generalization error bound for CNNs.
Abstract
Despite the success of convolutional neural networks (CNNs) in numerous computer vision tasks and their extraordinary generalization performances, several attempts to predict the generalization errors of CNNs have only been limited to a posteriori analyses thus far. A priori theories explaining the generalization performances of deep neural networks have mostly ignored the convolutionality aspect and do not specify why CNNs are able to seemingly overcome curse of dimensionality on computer vision tasks like image classification where the image dimensions are in thousands. Our work attempts to explain the generalization performance of CNNs on image classification under the hypothesis that CNNs operate on the domain of image patches. Ours is the first work we are aware of to derive an a priori error bound for the generalization error of CNNs and we present both quantitative and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Neural Networks and Applications
MethodsCutout · Attentive Walk-Aggregating Graph Neural Network · CutMix
