Wasserstein Distributionally Robust Estimation in High Dimensions: Performance Analysis and Optimal Hyperparameter Tuning
Liviu Aolaritei, Soroosh Shafiee, Florian D\"orfler

TL;DR
This paper develops a high-dimensional distributionally robust linear regression method that precisely characterizes estimation error and enables efficient optimal hyperparameter tuning, matching cross-validation performance with less computational effort.
Contribution
It introduces a novel high-dimensional analysis of DRO-based linear regression, providing a simple convex optimization for optimal robustness radius selection.
Findings
Estimation error can be characterized by a convex-concave problem involving four scalar variables.
Optimal robustness radius minimizes estimation error and matches cross-validation results.
The proposed method achieves comparable performance to cross-validation with significantly reduced computational cost.
Abstract
Distributionally robust optimization (DRO) has become a powerful framework for estimation under uncertainty, offering strong out-of-sample performance and principled regularization. In this paper, we propose a DRO-based method for linear regression and address a central question: how to optimally choose the robustness radius, which controls the trade-off between robustness and accuracy. Focusing on high-dimensional settings where the dimension and the number of samples are both large and comparable in size, we employ tools from high-dimensional asymptotic statistics to precisely characterize the estimation error of the resulting estimator. Remarkably, this error can be recovered by solving a simple convex-concave optimization problem involving only four scalar variables. This characterization enables efficient selection of the radius that minimizes the estimation error. In doing so, it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Statistical Methods and Inference
