Adaptivity of deep ReLU network for learning in Besov and mixed smooth Besov spaces: optimal rate and curse of dimensionality
Taiji Suzuki

TL;DR
This paper provides a theoretical analysis demonstrating that deep ReLU networks can adaptively learn functions in Besov spaces, achieving optimal rates and avoiding the curse of dimensionality, thus explaining their success in high-dimensional tasks.
Contribution
It develops a new approximation and estimation error analysis for deep ReLU networks in Besov spaces, showing their optimality and adaptivity compared to linear estimators.
Findings
Deep learning achieves minimax optimal rates in Besov spaces.
Deep networks outperform linear estimators like kernel ridge regression.
Deep learning can avoid the curse of dimensionality in mixed smooth Besov spaces.
Abstract
Deep learning has shown high performances in various types of tasks from visual recognition to natural language processing, which indicates superior flexibility and adaptivity of deep learning. To understand this phenomenon theoretically, we develop a new approximation and estimation error analysis of deep learning with the ReLU activation for functions in a Besov space and its variant with mixed smoothness. The Besov space is a considerably general function space including the Holder space and Sobolev space, and especially can capture spatial inhomogeneity of smoothness. Through the analysis in the Besov space, it is shown that deep learning can achieve the minimax optimal rate and outperform any non-adaptive (linear) estimator such as kernel ridge regression, which shows that deep learning has higher adaptivity to the spatial inhomogeneity of the target function than other estimators…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Advanced Data Compression Techniques · Seismic Imaging and Inversion Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia?
