Approximation Properties of Deep ReLU CNNs

Juncai He; Lin Li; Jinchao Xu

arXiv:2109.00190·cs.LG·July 4, 2022

Approximation Properties of Deep ReLU CNNs

Juncai He, Lin Li, Jinchao Xu

PDF

TL;DR

This paper establishes the approximation capabilities of deep ReLU CNNs in 2D, linking their structure to universal approximation theorems and analyzing variants like ResNet and MgNet.

Contribution

It provides the first comprehensive analysis of approximation properties for deep ReLU CNNs, including their connection to one-hidden-layer ReLU networks and variants like ResNet and MgNet.

Findings

01

Deep ReLU CNNs can universally approximate functions in L^2 space.

02

The analysis connects CNN approximation properties to those of shallow ReLU networks.

03

Approximation properties are extended to ResNet, pre-act ResNet, and MgNet architectures.

Abstract

This paper focuses on establishing $L^{2}$ approximation properties for deep ReLU convolutional neural networks (CNNs) in two-dimensional space. The analysis is based on a decomposition theorem for convolutional kernels with a large spatial size and multi-channels. Given the decomposition result, the property of the ReLU activation function, and a specific structure for channels, a universal approximation theorem of deep ReLU CNNs with classic structure is obtained by showing its connection with one-hidden-layer ReLU neural networks (NNs). Furthermore, approximation properties are obtained for one version of neural networks with ResNet, pre-act ResNet, and MgNet architecture based on connections between these networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Residual Connection · Residual Block · 1x1 Convolution · Max Pooling · Kaiming Initialization · Convolution · Batch Normalization · Global Average Pooling