Optimal Approximation Rate of ReLU Networks in terms of Width and Depth
Zuowei Shen, Haizhao Yang, Shijun Zhang

TL;DR
This paper establishes the optimal approximation rates of deep ReLU neural networks in terms of width and depth for functions on [0,1]^d, improving existing bounds by including a logarithmic factor and extending results to arbitrary continuous functions.
Contribution
It proves that ReLU networks can achieve nearly optimal approximation rates with explicit constructions, including new bounds involving logarithmic factors and fixed-depth networks for Lipschitz functions.
Findings
ReLU networks with specified width and depth approximate Hölder functions at optimal rates.
The approximation rate for arbitrary continuous functions depends on the modulus of continuity.
Fixed-depth networks can approximate Lipschitz functions with a rate involving W ln W, a novel result.
Abstract
This paper concentrates on the approximation power of deep feed-forward neural networks in terms of width and depth. It is proved by construction that ReLU networks with width and depth can approximate a H\"older continuous function on with an approximation rate , where and are H\"older order and constant, respectively. Such a rate is optimal up to a constant in terms of width and depth separately, while existing results are only nearly optimal without the logarithmic factor in the approximation rate. More generally, for an arbitrary continuous function on , the approximation rate becomes , where is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia?
