On the approximation of functions by tanh neural networks

Tim De Ryck; Samuel Lanthaler; Siddhartha Mishra

arXiv:2104.08938·math.NA·December 9, 2021

On the approximation of functions by tanh neural networks

Tim De Ryck, Samuel Lanthaler, Siddhartha Mishra

PDF

TL;DR

This paper establishes explicit error bounds for approximating Sobolev and analytic functions using shallow tanh neural networks, demonstrating their efficiency compared to deeper ReLU networks.

Contribution

It provides the first explicit Sobolev norm error bounds for shallow tanh neural networks and compares their approximation rates favorably to deeper ReLU networks.

Findings

01

Tanh neural networks with two hidden layers can approximate functions as well as or better than deeper ReLU networks.

02

Explicit error bounds are derived in high-order Sobolev norms.

03

Shallow tanh networks are effective for high-regularity function approximation.

Abstract

We derive bounds on the error, in high-order Sobolev norms, incurred in the approximation of Sobolev-regular as well as analytic functions by neural networks with the hyperbolic tangent activation function. These bounds provide explicit estimates on the approximation error with respect to the size of the neural networks. We show that tanh neural networks with only two hidden layers suffice to approximate functions at comparable or better rates than much deeper ReLU neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.