Sparse deep neural networks for nonparametric estimation in high-dimensional sparse regression
Dongya Wu, Xin Li

TL;DR
This paper develops a nonparametric approach to estimate partial derivatives in sparse deep neural networks, enabling variable selection and interpretability in high-dimensional settings, with proven convergence rates.
Contribution
It introduces a novel nonparametric estimation method for partial derivatives in deep neural networks, addressing unidentifiability issues and establishing convergence guarantees.
Findings
Sample complexity grows logarithmically with parameters and input dimension.
Convergence rate of partial derivative estimation is O(n^{-1/4}).
First integration of nonparametric estimation with sparse deep neural networks.
Abstract
Generalization theory has been established for sparse deep neural networks under high-dimensional regime. Beyond generalization, parameter estimation is also important since it is crucial for variable selection and interpretability of deep neural networks. Current theoretical studies concerning parameter estimation mainly focus on two-layer neural networks, which is due to the fact that the convergence of parameter estimation heavily relies on the regularity of the Hessian matrix, while the Hessian matrix of deep neural networks is highly singular. To avoid the unidentifiability of deep neural networks in parameter estimation, we propose to conduct nonparametric estimation of partial derivatives with respect to inputs. We first show that model convergence of sparse deep neural networks is guaranteed in that the sample complexity only grows with the logarithm of the number of parameters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Gaussian Processes and Bayesian Inference · Sparse and Compressive Sensing Techniques
MethodsFocus
