Learning Ability of Interpolating Deep Convolutional Neural Networks
Tian-Yi Zhou, Xiaoming Huo

TL;DR
This paper provides the first theoretical analysis of the learning capabilities of deep convolutional neural networks (DCNNs), demonstrating their ability to generalize well even when overfitted, through new learning rate results and a deepening scheme.
Contribution
It establishes the first learning rates for underparameterized DCNNs without structural restrictions and introduces a novel deepening scheme to create interpolating DCNNs with good learning rates.
Findings
Established learning rates for underparameterized DCNNs.
Showed that adding layers can produce interpolating DCNNs with maintained learning rates.
Provided theoretical insight into why overfitted DCNNs generalize well.
Abstract
It is frequently observed that overparameterized neural networks generalize well. Regarding such phenomena, existing theoretical work mainly devotes to linear settings or fully-connected neural networks. This paper studies the learning ability of an important family of deep neural networks, deep convolutional neural networks (DCNNs), under both underparameterized and overparameterized settings. We establish the first learning rates of underparameterized DCNNs without parameter or function variable structure restrictions presented in the literature. We also show that by adding well-defined layers to a non-interpolating DCNN, we can obtain some interpolating DCNNs that maintain the good learning rates of the non-interpolating DCNN. This result is achieved by a novel network deepening scheme designed for DCNNs. Our work provides theoretical verification of how overfitted DCNNs generalize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Neural Networks and Applications · Machine Learning and Algorithms
MethodsDiffusion-Convolutional Neural Networks
