Lasso, knockoff and Gaussian covariates: a comparison
Laurie Davies

TL;DR
This paper compares lasso, knockoff, and Gaussian covariate methods for variable selection in linear regression, demonstrating that the Gaussian covariate approach is faster, simpler, and more accurate across simulations and real data.
Contribution
The paper introduces and validates a model-free Gaussian covariate method that outperforms existing techniques in variable selection tasks.
Findings
Gaussian covariate method is faster and simpler.
It provides more accurate variable selection.
It outperforms lasso and knockoff in all tested scenarios.
Abstract
Given data and covariates one problem in linear regression is to decide which if any of the covariates to include when regressing the dependent variable on the covariates . In this paper three such methods, lasso, knockoff and Gaussian covariates are compared using simulations and real data. The Gaussian covariate method is based on exact probabilities which are valid for all and making it model free. Moreover the probabilities agree with those based on the F-distribution for the standard linear model with i.i.d. Gaussian errors. It is conceptually, mathematically and algorithmically very simple, it is very fast and makes no use of simulations. It outperforms lasso and knockoff in all respects by a considerable margin.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Bayesian Methods and Mixture Models
