Robust Methods for High-Dimensional Linear Learning

Ibrahim Merad; St\'ephane Ga\"iffas

arXiv:2208.05447·stat.ML·May 30, 2023

Robust Methods for High-Dimensional Linear Learning

Ibrahim Merad, St\'ephane Ga\"iffas

PDF

Open Access

TL;DR

This paper introduces robust, efficient linear learning algorithms for high-dimensional settings with heavy-tailed data and outliers, achieving near-optimal rates across various applications.

Contribution

It develops a generic framework with two algorithms for different loss functions, applicable to sparse, group-sparse, and low-rank recovery, with open-source implementation.

Findings

01

Algorithms reach near-optimal estimation rates under heavy tails and outliers.

02

Implementation in Python library 'linlearn' confirms theoretical results.

03

Comparable computational cost to non-robust methods.

Abstract

We propose statistically robust and computationally efficient linear learning methods in the high-dimensional batch setting, where the number of features $d$ may exceed the sample size $n$ . We employ, in a generic learning setting, two algorithms depending on whether the considered loss function is gradient-Lipschitz or not. Then, we instantiate our framework on several applications including vanilla sparse, group-sparse and low-rank matrix recovery. This leads, for each application, to efficient and robust learning algorithms, that reach near-optimal estimation rates under heavy-tailed distributions and the presence of outliers. For vanilla $s$ -sparsity, we are able to reach the $s lo g (d) / n$ rate under heavy-tails and $η$ -corruption, at a computational cost comparable to that of non-robust analogs. We provide an efficient implementation of our algorithms in an open-source…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning

MethodsLib