Theory of Deep Convolutional Neural Networks III: Approximating Radial Functions
Tong Mao, Zhongjie Shi, and Ding-Xuan Zhou

TL;DR
This paper develops an approximation theory for deep convolutional neural networks with explicit rates, demonstrating their superiority over shallow networks in approximating radial functions, and analyzes their generalization in regression tasks.
Contribution
It provides the first rigorous proof of deep CNNs outperforming shallow networks for radial functions and offers a generalization analysis with an error bound showing a depth trade-off.
Findings
Deep CNNs can approximate radial functions more efficiently than shallow networks.
The approximation error decreases then increases with network depth, indicating an optimal depth.
Theoretical validation of the depth trade-off phenomenon in neural networks.
Abstract
We consider a family of deep neural networks consisting of two groups of convolutional layers, a downsampling operator, and a fully connected layer. The network structure depends on two structural parameters which determine the numbers of convolutional layers and the width of the fully connected layer. We establish an approximation theory with explicit approximation rates when the approximated function takes a composite form with a feature polynomial and a univariate function . In particular, we prove that such a network can outperform fully connected shallow networks in approximating radial functions with , when the dimension of data from is large. This gives the first rigorous proof for the superiority of deep convolutional neural networks in approximating functions with special structures. Then we carry out generalization analysis for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Generative Adversarial Networks and Image Synthesis
