Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholding

The Tien Mai

arXiv:2506.07790·stat.ME·June 10, 2025·Stat. Comput.

Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholding

The Tien Mai

PDF

Open Access 1 Repo

TL;DR

Heavy Lasso is a robust high-dimensional regression method that uses a Student's t-inspired loss with Lasso penalty, effectively handling heavy-tailed noise and outliers with theoretical guarantees and efficient algorithms.

Contribution

It introduces a novel loss function combining robustness to heavy-tailed noise with Lasso regularization, supported by theoretical bounds and practical algorithms.

Findings

01

Outperforms classical Lasso in heavy-tailed noise scenarios

02

Achieves comparable rates to Huber loss with robust properties

03

Demonstrates superior empirical performance in simulations

Abstract

High-dimensional linear regression is a fundamental tool in modern statistics, particularly when the number of predictors exceeds the sample size. The classical Lasso, which relies on the squared loss, performs well under Gaussian noise assumptions but often deteriorates in the presence of heavy-tailed errors or outliers commonly encountered in real data applications such as genomics, finance, and signal processing. To address these challenges, we propose a novel robust regression method, termed Heavy Lasso, which incorporates a loss function inspired by the Student's t-distribution within a Lasso penalization framework. This loss retains the desirable quadratic behavior for small residuals while adaptively downweighting large deviations, thus enhancing robustness to heavy-tailed noise and outliers. Heavy Lasso enjoys computationally efficient by leveraging a data augmentation scheme…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tienmt/heavylasso
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms

MethodsLinear Regression · Huber loss