Neural networks: deep, shallow, or in between?

Guergana Petrova; Przemyslaw Wojtaszczyk

arXiv:2310.07190·stat.ML·October 12, 2023

Neural networks: deep, shallow, or in between?

Guergana Petrova, Przemyslaw Wojtaszczyk

PDF

Open Access

TL;DR

This paper provides theoretical estimates on the approximation error of neural networks, showing that only infinitely deep networks can surpass entropy number rates, with no advantage gained by increasing width at fixed depth.

Contribution

It establishes lower bounds for neural network approximation errors and clarifies the roles of depth and width in approximation capabilities.

Findings

01

Infinite depth can improve approximation rates beyond entropy numbers.

02

Increasing width alone at fixed depth does not improve approximation rates.

03

Theoretical bounds depend on network architecture and Lipschitz activation functions.

Abstract

We give estimates from below for the error of approximation of a compact subset from a Banach space by the outputs of feed-forward neural networks with width W, depth l and Lipschitz activation functions. We show that, modulo logarithmic factors, rates better that entropy numbers' rates are possibly attainable only for neural networks for which the depth l goes to infinity, and that there is no gain if we fix the depth and let the width W go to infinity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Advanced Numerical Analysis Techniques