Sparsity Oriented Importance Learning for High-dimensional Linear Regression
Chenglong Ye, Yi Yang, Yuhong Yang

TL;DR
This paper introduces SOIL, a new variable importance measure for high-dimensional linear regression that accounts for model selection uncertainty, improving reliability in identifying true variables.
Contribution
The paper proposes SOIL, a theoretically justified importance measure that effectively distinguishes true variables from noise in high-dimensional sparse linear models.
Findings
SOIL effectively separates true variables from noise.
It maintains accuracy even with weak signals.
Outperforms existing importance measures in simulations and real data.
Abstract
With now well-recognized non-negligible model selection uncertainty, data analysts should no longer be satisfied with the output of a single final model from a model selection process, regardless of its sophistication. To improve reliability and reproducibility in model choice, one constructive approach is to make good use of a sound variable importance measure. Although interesting importance measures are available and increasingly used in data analysis, little theoretical justification has been done. In this paper, we propose a new variable importance measure, sparsity oriented importance learning (SOIL), for high-dimensional regression from a sparse linear modeling perspective by taking into account the variable selection uncertainty via the use of a sensible model weighting. The SOIL method is theoretically shown to have the inclusion/exclusion property: When the model weights are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Probabilistic and Robust Engineering Design · Statistical Methods and Bayesian Inference
