Semi-Analytic Resampling in Lasso

Tomoyuki Obuchi; Yoshiyuki Kabashima

arXiv:1802.10254·stat.ML·December 11, 2018·J. Mach. Learn. Res.

Semi-Analytic Resampling in Lasso

Tomoyuki Obuchi, Yoshiyuki Kabashima

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semi-analytic resampling method for Lasso that reduces computational cost and variance in variable selection, using message passing algorithms and state evolution analysis, validated on simulated and real datasets.

Contribution

It develops a novel semi-analytic resampling approach for Lasso that eliminates repeated sampling, improving efficiency and stability in variable selection tasks.

Findings

01

Method achieves accurate approximation of resampling averages.

02

Significant reduction in computational time for bootstrapped Lasso.

03

Effective variable relevance criterion demonstrated on real-world data.

Abstract

An approximate method for conducting resampling in Lasso, the $ℓ_{1}$ penalized linear regression, in a semi-analytic manner is developed, whereby the average over the resampled datasets is directly computed without repeated numerical sampling, thus enabling an inference free of the statistical fluctuations due to sampling finiteness, as well as a significant reduction of computational time. The proposed method is based on a message passing type algorithm, and its fast convergence is guaranteed by the state evolution analysis, when covariates are provided as zero-mean independently and identically distributed Gaussian random variables. It is employed to implement bootstrapped Lasso (Bolasso) and stability selection, both of which are variable selection methods using resampling in conjunction with Lasso, and resolves their disadvantage regarding computational cost. To examine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

T-Obuchi/AMPR_lasso_matlab
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Financial Risk and Volatility Modeling