A Robust Learning Algorithm for Regression Models Using Distributionally Robust Optimization under the Wasserstein Metric
Ruidi Chen, Ioannis Ch. Paschalidis

TL;DR
This paper introduces a distributionally robust optimization method for linear regression that effectively handles adversarial outliers by considering a Wasserstein metric-based distribution set, leading to improved prediction and estimation accuracy.
Contribution
It develops a convex DRO framework that generalizes regularized regression models and provides theoretical guarantees and practical guidance for robust regression under outlier contamination.
Findings
Outperforms traditional regression models in prediction accuracy.
Achieves higher AUC in outlier detection tasks.
Provides theoretical bounds on out-of-sample and estimation errors.
Abstract
We present a Distributionally Robust Optimization (DRO) approach to estimate a robustified regression plane in a linear regression setting, when the observed samples are potentially contaminated with adversarially corrupted outliers. Our approach mitigates the impact of outliers through hedging against a family of distributions on the observed data, some of which assign very low probabilities to the outliers. The set of distributions under consideration are close to the empirical distribution in the sense of the Wasserstein metric. We show that this DRO formulation can be relaxed to a convex optimization problem which encompasses a class of models. By selecting proper norm spaces for the Wasserstein metric, we are able to recover several commonly used regularized regression models. We provide new insights into the regularization term and give guidance on the selection of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Statistical Methods and Inference · Advanced Statistical Methods and Models
MethodsLinear Regression
