The phase diagram of approximation rates for deep neural networks
Dmitry Yarotsky, Anton Zhevnerchuk

TL;DR
This paper investigates the approximation capabilities of deep neural networks, revealing phase transitions in their approximation rates, and demonstrates how different architectures and activations influence these rates.
Contribution
It generalizes phase diagram results to various smoothness classes, compares activation functions, and introduces nearly exponential approximation rates for deep Fourier networks.
Findings
Deep discontinuous phase exists for arbitrary smoothness classes.
Networks with piecewise polynomial activations share the same phase diagram.
Deep Fourier networks achieve nearly exponential approximation rates.
Abstract
We explore the phase diagram of approximation rates for deep neural networks and prove several new theoretical results. In particular, we generalize the existing result on the existence of deep discontinuous phase in ReLU networks to functional classes of arbitrary positive smoothness, and identify the boundary between the feasible and infeasible rates. Moreover, we show that all networks with a piecewise polynomial activation function have the same phase diagram. Next, we demonstrate that standard fully-connected architectures with a fixed width independent of smoothness can adapt to smoothness and achieve almost optimal rates. Finally, we consider deep networks with periodic activations ("deep Fourier expansion") and prove that they have very fast, nearly exponential approximation rates, thanks to the emerging capability of the network to implement efficient lookup operations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning in Materials Science · Stochastic Gradient Optimization Techniques · Machine Learning and ELM
Methods*Communicated@Fast*How Do I Communicate to Expedia?
