Regularized least squares learning with heavy-tailed noise is minimax optimal

Mattes Mollenhauer; Nicole M\"ucke; Dimitri Meunier; Arthur Gretton

arXiv:2505.14214·cs.LG·November 7, 2025

Regularized least squares learning with heavy-tailed noise is minimax optimal

Mattes Mollenhauer, Nicole M\"ucke, Dimitri Meunier, Arthur Gretton

PDF

Open Access 1 Video

TL;DR

This paper proves that regularized least squares in reproducing kernel Hilbert spaces achieves optimal convergence rates even with heavy-tailed noise, demonstrating robustness beyond traditional subexponential assumptions.

Contribution

It establishes minimax optimal excess risk bounds for ridge regression under heavy-tailed noise using a novel Fuk-Nagaev inequality approach.

Findings

01

Achieves convergence rates previously only known under subexponential noise

02

Demonstrates robustness of regularized least squares to heavy-tailed noise

03

Provides theoretical guarantees under standard eigenvalue decay conditions

Abstract

This paper examines the performance of ridge regression in reproducing kernel Hilbert spaces in the presence of noise that exhibits a finite number of higher moments. We establish excess risk bounds consisting of subgaussian and polynomial terms based on the well known integral operator framework. The dominant subgaussian component allows to achieve convergence rates that have previously only been derived under subexponential noise - a prevalent assumption in related work from the last two decades. These rates are optimal under standard eigenvalue decay conditions, demonstrating the asymptotic robustness of regularized least squares against heavy-tailed noise. Our derivations are based on a Fuk-Nagaev inequality for Hilbert-space valued random variables.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Regularized least squares learning with heavy-tailed noise is minimax optimal· slideslive

Taxonomy

TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques