Optimal Approximation Rate of ReLU Networks in terms of Width and Depth

Zuowei Shen; Haizhao Yang; Shijun Zhang

arXiv:2103.00502·cs.LG·December 15, 2021

Optimal Approximation Rate of ReLU Networks in terms of Width and Depth

Zuowei Shen, Haizhao Yang, Shijun Zhang

PDF

TL;DR

This paper establishes the optimal approximation rates of deep ReLU neural networks in terms of width and depth for functions on [0,1]^d, improving existing bounds by including a logarithmic factor and extending results to arbitrary continuous functions.

Contribution

It proves that ReLU networks can achieve nearly optimal approximation rates with explicit constructions, including new bounds involving logarithmic factors and fixed-depth networks for Lipschitz functions.

Findings

01

ReLU networks with specified width and depth approximate Hölder functions at optimal rates.

02

The approximation rate for arbitrary continuous functions depends on the modulus of continuity.

03

Fixed-depth networks can approximate Lipschitz functions with a rate involving W ln W, a novel result.

Abstract

This paper concentrates on the approximation power of deep feed-forward neural networks in terms of width and depth. It is proved by construction that ReLU networks with width $O (max {d ⌊ N^{1/ d} ⌋, N + 2})$ and depth $O (L)$ can approximate a H\"older continuous function on $[0, 1]^{d}$ with an approximation rate $O (λ d (N^{2} L^{2} ln N)^{- α / d})$ , where $α \in (0, 1]$ and $λ > 0$ are H\"older order and constant, respectively. Such a rate is optimal up to a constant in terms of width and depth separately, while existing results are only nearly optimal without the logarithmic factor in the approximation rate. More generally, for an arbitrary continuous function $f$ on $[0, 1]^{d}$ , the approximation rate becomes $O (d ω_{f} ((N^{2} L^{2} ln N)^{- 1/ d}))$ , where $ω_{f} (\cdot)$ is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods*Communicated@Fast*How Do I Communicate to Expedia?