Fast Distributionally Robust Learning with Variance Reduced Min-Max   Optimization

Yaodong Yu; Tianyi Lin; Eric Mazumdar; Michael I. Jordan

arXiv:2104.13326·cs.LG·January 26, 2022·6 cites

Fast Distributionally Robust Learning with Variance Reduced Min-Max Optimization

Yaodong Yu, Tianyi Lin, Eric Mazumdar, Michael I. Jordan

PDF

Open Access

TL;DR

This paper introduces scalable stochastic algorithms with variance reduction for Wasserstein distributionally robust supervised learning, significantly improving convergence and efficiency in large-scale problems.

Contribution

The paper develops and analyzes variance-reduced stochastic extra-gradient algorithms for Wasserstein DRSL, enabling faster convergence and practical scalability.

Findings

01

Algorithms outperform existing methods in synthetic and real data.

02

Variance reduction accelerates stochastic min-max optimization.

03

Proven faster convergence rates than prior approaches.

Abstract

Distributionally robust supervised learning (DRSL) is emerging as a key paradigm for building reliable machine learning systems for real-world applications -- reflecting the need for classifiers and predictive models that are robust to the distribution shifts that arise from phenomena such as selection bias or nonstationarity. Existing algorithms for solving Wasserstein DRSL -- one of the most popular DRSL frameworks based around robustness to perturbations in the Wasserstein distance -- have serious limitations that limit their use in large-scale problems -- in particular they involve solving complex subproblems and they fail to make use of stochastic gradients. We revisit Wasserstein DRSL through the lens of min-max optimization and derive scalable and efficiently implementable stochastic extra-gradient algorithms which provably achieve faster convergence rates than existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference