Deep Network Approximation in Terms of Intrinsic Parameters

Zuowei Shen; Haizhao Yang; Shijun Zhang

arXiv:2111.07964·cs.LG·September 15, 2022·1 cites

Deep Network Approximation in Terms of Intrinsic Parameters

Zuowei Shen, Haizhao Yang, Shijun Zhang

PDF

Open Access

TL;DR

This paper demonstrates that deep neural networks can approximate functions effectively with significantly fewer learnable parameters than traditionally thought, combining theoretical design and empirical validation.

Contribution

It introduces a method to construct ReLU networks with minimal intrinsic parameters that still achieve high approximation accuracy, supported by theoretical proofs and experiments.

Findings

01

ReLU networks with n+2 intrinsic parameters approximate Lipschitz functions exponentially well.

02

Small parameter subsets can be trained effectively for classification tasks.

03

Theoretical and empirical evidence supports learning with fewer parameters.

Abstract

One of the arguments to explain the success of deep learning is the powerful approximation capacity of deep neural networks. Such capacity is generally accompanied by the explosive growth of the number of parameters, which, in turn, leads to high computational costs. It is of great interest to ask whether we can achieve successful deep learning with a small number of learnable parameters adapting to the target function. From an approximation perspective, this paper shows that the number of parameters that need to be learned can be significantly smaller than people typically expect. First, we theoretically design ReLU networks with a few learnable parameters to achieve an attractive approximation. We prove by construction that, for any Lipschitz continuous function $f$ on $[0, 1]^{d}$ with a Lipschitz constant $λ > 0$ , a ReLU network with $n + 2$ intrinsic parameters (those depending on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Neural Networks and Applications · Machine Learning and Data Classification