Approximation Rates for Shallow ReLU$^k$ Neural Networks on Sobolev Spaces via the Radon Transform

Tong Mao; Jonathan W. Siegel; Jinchao Xu

arXiv:2408.10996·stat.ML·October 17, 2025

Approximation Rates for Shallow ReLU$^k$ Neural Networks on Sobolev Spaces via the Radon Transform

Tong Mao, Jonathan W. Siegel, Jinchao Xu

PDF

TL;DR

This paper establishes nearly optimal approximation rates for shallow ReLU$^k$ neural networks on Sobolev spaces using the Radon transform, extending previous results and highlighting their adaptivity for smoothness up to a certain order.

Contribution

It provides a simple proof of near-optimal approximation rates for shallow ReLU$^k$ networks on Sobolev spaces, generalizing existing results through the Radon transform and discrepancy theory.

Findings

01

Approximation rates are nearly optimal up to logarithmic factors.

02

Shallow ReLU$^k$ networks adaptively achieve optimal rates for smoothness up to s = k + (d+1)/2.

03

Results hold for various cases including q ≤ p, p ≥ 2, and s ≤ k + (d+1)/2.

Abstract

Let $Ω \subset R^{d}$ be a bounded domain. We consider the problem of how efficiently shallow neural networks with the ReLU $^{k}$ activation function can approximate functions from Sobolev spaces $W^{s} (L_{p} (Ω))$ with error measured in the $L_{q} (Ω)$ -norm. Utilizing the Radon transform and recent results from discrepancy theory, we provide a simple proof of nearly optimal approximation rates in a variety of cases, including when $q \leq p$ , $p \geq 2$ , and $s \leq k + (d + 1) /2$ . The rates we derive are optimal up to logarithmic factors, and significantly generalize existing results. An interesting consequence is that the adaptivity of shallow ReLU $^{k}$ neural networks enables them to obtain optimal approximation rates for smoothness up to order $s = k + (d + 1) /2$ , even though they represent piecewise polynomials of fixed degree $k$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.