Minimax Optimal Robust Sparse Regression with Heavy-Tailed Designs: A Gradient-Based Approach

Kaiyuan Zhou; Xiaoyu Zhang; Wenyang Zhang; and Di Wang

arXiv:2601.05669·stat.ME·January 12, 2026

Minimax Optimal Robust Sparse Regression with Heavy-Tailed Designs: A Gradient-Based Approach

Kaiyuan Zhou, Xiaoyu Zhang, Wenyang Zhang, and Di Wang

PDF

Open Access

TL;DR

This paper introduces a gradient-based method called RIGHT for robust sparse regression in high dimensions with heavy-tailed noise and designs, revealing fundamental error and sample complexity limits, and achieving optimal rates.

Contribution

The paper proposes a unified robust gradient descent framework that handles heavy-tailed data without higher-order moments, and characterizes the fundamental limits of estimation accuracy and sample complexity.

Findings

01

RIGHT achieves minimax optimal rates in heavy-tailed regimes.

02

In linear regression, error depends on noise tail index, sample complexity on design tail index.

03

In logistic regression, bounded gradients naturally provide robustness.

Abstract

We investigate high-dimensional sparse regression when both the noise and the design matrix exhibit heavy-tailed behavior. Standard algorithms typically fail in this regime, as heavy-tailed covariates distort the empirical risk geometry. We propose a unified framework, Robust Iterative Gradient descent with Hard Thresholding (RIGHT), which employs a robust gradient estimator to bypass the need for higher-order moment conditions. Our analysis reveals a fundamental decoupling phenomenon: in linear regression, the estimation error rate is governed by the noise tail index, while the sample complexity required for stability is governed by the design tail index. This implies that while heavy-tailed noise limits precision, heavy-tailed designs primarily raise the sample size barrier for convergence. In contrast, for logistic regression, we show that the bounded gradient naturally robustifies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Sparse and Compressive Sensing Techniques