Sparse-Input Neural Networks for High-dimensional Nonparametric   Regression and Classification

Jean Feng; Noah Simon

arXiv:1711.07592·stat.ME·June 25, 2019·56 cites

Sparse-Input Neural Networks for High-dimensional Nonparametric Regression and Classification

Jean Feng, Noah Simon

PDF

Open Access 1 Repo

TL;DR

This paper introduces a sparse-input neural network approach with regularization that effectively handles high-dimensional nonparametric regression and classification by selecting relevant features and achieving favorable convergence properties.

Contribution

The authors propose a novel neural network method with a sparse group lasso penalty on input weights, enabling feature selection and improved performance in high-dimensional settings.

Findings

01

Sparse-input neural networks outperform existing methods in complex high-dimensional data.

02

The method's excess risk grows only logarithmically with the number of features.

03

Irrelevant feature weights converge to zero, ensuring effective feature selection.

Abstract

Neural networks are usually not the tool of choice for nonparametric high-dimensional problems where the number of input features is much larger than the number of observations. Though neural networks can approximate complex multivariate functions, they generally require a large number of training observations to obtain reasonable fits, unless one can learn the appropriate network structure. In this manuscript, we show that neural networks can be applied successfully to high-dimensional settings if the true function falls in a low dimensional subspace, and proper regularization is used. We propose fitting a neural network with a sparse group lasso penalty on the first-layer input weights. This results in a neural net that only uses a small subset of the original features. In addition, we characterize the statistical convergence of the penalized empirical risk minimizer to the optimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jjfeng/spinn
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Face and Expression Recognition · Anomaly Detection Techniques and Applications