Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholding
The Tien Mai

TL;DR
Heavy Lasso is a robust high-dimensional regression method that uses a Student's t-inspired loss with Lasso penalty, effectively handling heavy-tailed noise and outliers with theoretical guarantees and efficient algorithms.
Contribution
It introduces a novel loss function combining robustness to heavy-tailed noise with Lasso regularization, supported by theoretical bounds and practical algorithms.
Findings
Outperforms classical Lasso in heavy-tailed noise scenarios
Achieves comparable rates to Huber loss with robust properties
Demonstrates superior empirical performance in simulations
Abstract
High-dimensional linear regression is a fundamental tool in modern statistics, particularly when the number of predictors exceeds the sample size. The classical Lasso, which relies on the squared loss, performs well under Gaussian noise assumptions but often deteriorates in the presence of heavy-tailed errors or outliers commonly encountered in real data applications such as genomics, finance, and signal processing. To address these challenges, we propose a novel robust regression method, termed Heavy Lasso, which incorporates a loss function inspired by the Student's t-distribution within a Lasso penalization framework. This loss retains the desirable quadratic behavior for small residuals while adaptively downweighting large deviations, thus enhancing robustness to heavy-tailed noise and outliers. Heavy Lasso enjoys computationally efficient by leveraging a data augmentation scheme…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
MethodsLinear Regression · Huber loss
