Deep Network Approximation in Terms of Intrinsic Parameters
Zuowei Shen, Haizhao Yang, Shijun Zhang

TL;DR
This paper demonstrates that deep neural networks can approximate functions effectively with significantly fewer learnable parameters than traditionally thought, combining theoretical design and empirical validation.
Contribution
It introduces a method to construct ReLU networks with minimal intrinsic parameters that still achieve high approximation accuracy, supported by theoretical proofs and experiments.
Findings
ReLU networks with n+2 intrinsic parameters approximate Lipschitz functions exponentially well.
Small parameter subsets can be trained effectively for classification tasks.
Theoretical and empirical evidence supports learning with fewer parameters.
Abstract
One of the arguments to explain the success of deep learning is the powerful approximation capacity of deep neural networks. Such capacity is generally accompanied by the explosive growth of the number of parameters, which, in turn, leads to high computational costs. It is of great interest to ask whether we can achieve successful deep learning with a small number of learnable parameters adapting to the target function. From an approximation perspective, this paper shows that the number of parameters that need to be learned can be significantly smaller than people typically expect. First, we theoretically design ReLU networks with a few learnable parameters to achieve an attractive approximation. We prove by construction that, for any Lipschitz continuous function on with a Lipschitz constant , a ReLU network with intrinsic parameters (those depending on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Neural Networks and Applications · Machine Learning and Data Classification
