Optimal Approximation and Learning Rates for Deep Convolutional Neural Networks
Shao-Bo Lin

TL;DR
This paper analyzes the approximation and learning capabilities of deep convolutional neural networks, establishing near-optimal rates for approximating smooth functions and for empirical risk minimization.
Contribution
It provides theoretical proofs of approximation and learning rates for deep CNNs, showing they are nearly optimal up to a logarithmic factor.
Findings
Approximation rates for smooth functions are of order (L^2 / log L)^{-2r/d}.
Deep CNNs achieve almost optimal learning rates for empirical risk minimization.
The results are applicable to CNNs with zero-padding and max-pooling.
Abstract
This paper focuses on approximation and learning performance analysis for deep convolutional neural networks with zero-padding and max-pooling. We prove that, to approximate -smooth function, the approximation rates of deep convolutional neural networks with depth are of order , which is optimal up to a logarithmic factor. Furthermore, we deduce almost optimal learning rates for implementing empirical risk minimization over deep convolutional neural networks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Mathematical Approximation and Integration · Stochastic Gradient Optimization Techniques
