Equivalence of approximation by convolutional neural networks and   fully-connected networks

Philipp Petersen; Felix Voigtlaender

arXiv:1809.00973·math.FA·January 29, 2021

Equivalence of approximation by convolutional neural networks and fully-connected networks

Philipp Petersen, Felix Voigtlaender

PDF

TL;DR

This paper establishes a theoretical connection between convolutional neural networks and fully-connected networks, showing that approximation bounds for one translate to the other within certain function classes, specifically for translation-equivariant functions.

Contribution

It provides a mathematical link between CNNs and fully-connected networks, enabling transfer of approximation bounds and analysis techniques between these architectures.

Findings

01

Approximation bounds for fully-connected networks apply to CNNs for translation-equivariant functions.

02

The results are specific to CNNs without pooling and with circular convolutions.

03

The connection allows for unified theoretical analysis of both network types.

Abstract

Convolutional neural networks are the most widely used type of neural networks in applications. In mathematical analysis, however, mostly fully-connected networks are studied. In this paper, we establish a connection between both network architectures. Using this connection, we show that all upper and lower bounds concerning approximation rates of {fully-connected} neural networks for functions $f \in C$ -- for an arbitrary function class $C$ -- translate to essentially the same bounds concerning approximation rates of convolutional neural networks for functions $f \in C^{e q u i}$ , with the class $C^{e q u i}$ consisting of all translation equivariant functions whose first coordinate belongs to $C$ . All presented results consider exclusively the case of convolutional neural networks without any pooling operation and with circular…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.