Optimal Approximation and Learning Rates for Deep Convolutional Neural   Networks

Shao-Bo Lin

arXiv:2308.03259·cs.LG·August 8, 2023·1 cites

Optimal Approximation and Learning Rates for Deep Convolutional Neural Networks

Shao-Bo Lin

PDF

Open Access

TL;DR

This paper analyzes the approximation and learning capabilities of deep convolutional neural networks, establishing near-optimal rates for approximating smooth functions and for empirical risk minimization.

Contribution

It provides theoretical proofs of approximation and learning rates for deep CNNs, showing they are nearly optimal up to a logarithmic factor.

Findings

01

Approximation rates for smooth functions are of order (L^2 / log L)^{-2r/d}.

02

Deep CNNs achieve almost optimal learning rates for empirical risk minimization.

03

The results are applicable to CNNs with zero-padding and max-pooling.

Abstract

This paper focuses on approximation and learning performance analysis for deep convolutional neural networks with zero-padding and max-pooling. We prove that, to approximate $r$ -smooth function, the approximation rates of deep convolutional neural networks with depth $L$ are of order $(L^{2} / lo g L)^{- 2 r / d}$ , which is optimal up to a logarithmic factor. Furthermore, we deduce almost optimal learning rates for implementing empirical risk minimization over deep convolutional neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Mathematical Approximation and Integration · Stochastic Gradient Optimization Techniques