Robust supervised learning with coordinate gradient descent
St\'ephane Ga\"iffas, Ibrahim Merad

TL;DR
This paper introduces a robust supervised learning method using coordinate gradient descent with robust partial derivative estimators, effectively handling corrupted data with minimal additional computational cost.
Contribution
It proposes a novel combination of coordinate gradient descent and robust estimators for partial derivatives, achieving robustness with nearly the same complexity as standard methods.
Findings
Provides theoretical bounds on generalization error.
Demonstrates competitive performance in numerical experiments.
Offers an efficient Python implementation in linlearn.
Abstract
This paper considers the problem of supervised learning with linear methods when both features and labels can be corrupted, either in the form of heavy tailed data and/or corrupted rows. We introduce a combination of coordinate gradient descent as a learning algorithm together with robust estimators of the partial derivatives. This leads to robust statistical learning methods that have a numerical complexity nearly identical to non-robust ones based on empirical risk minimization. The main idea is simple: while robust learning with gradient descent requires the computational cost of robustly estimating the whole gradient to update all parameters, a parameter can be updated immediately using a robust estimator of a single partial derivative in coordinate gradient descent. We prove upper bounds on the generalization error of the algorithms derived from this idea, that control both the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Statistical Methods and Inference · Machine Learning and Algorithms
