LassoNet: A Neural Network with Feature Sparsity
Ismael Lemhadri, Feng Ruan, Louis Abraham, Robert Tibshirani

TL;DR
LassoNet introduces a neural network framework that enforces feature sparsity through a hierarchy constraint, enabling effective feature selection and regularization directly integrated into the learning process.
Contribution
It presents a novel hierarchical regularization method for neural networks that combines feature selection with parameter learning using a modified objective function.
Findings
Outperforms state-of-the-art feature selection methods
Provides a regularization path with varying feature sparsity
Easily integrated into standard neural network implementations
Abstract
Much work has been done recently to make neural networks more interpretable, and one obvious approach is to arrange for the network to use only a subset of the available features. In linear models, Lasso (or -regularized) regression assigns zero weights to the most irrelevant or redundant features, and is widely used in data science. However the Lasso only applies to linear models. Here we introduce LassoNet, a neural network framework with global feature selection. Our approach enforces a hierarchy: specifically a feature can participate in a hidden unit only if its linear representative is active. Unlike other approaches to feature selection for neural nets, our method uses a modified objective function with constraints, and so integrates feature selection with the parameter learning directly. As a result, it delivers an entire regularization path of solutions with a range of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms
MethodsFeature Selection · Residual Connection
