Minimax Regret Optimization for Robust Machine Learning under Distribution Shift
Alekh Agarwal, Tong Zhang

TL;DR
This paper introduces Minimax Regret Optimization (MRO), a new approach for robust machine learning that aims to achieve uniformly low regret under distribution shifts, addressing limitations of existing distributionally robust methods.
Contribution
The paper proposes MRO as an alternative to DRO, providing theoretical guarantees for low regret across all potential test distributions under distribution shift.
Findings
MRO achieves uniformly low regret under distribution shift.
DRO does not guarantee low regret uniformly across test distributions.
MRO adapts to heterogeneous test distributions for stronger guarantees.
Abstract
In this paper, we consider learning scenarios where the learned model is evaluated under an unknown test distribution which potentially differs from the training distribution (i.e. distribution shift). The learner has access to a family of weight functions such that the test distribution is a reweighting of the training distribution under one of these functions, a setting typically studied under the name of Distributionally Robust Optimization (DRO). We consider the problem of deriving regret bounds in the classical learning theory setting, and require that the resulting regret bounds hold uniformly for all potential test distributions. We show that the DRO formulation does not guarantee uniformly small regret under distribution shift. We instead propose an alternative method called Minimax Regret Optimization (MRO), and show that under suitable conditions this method achieves uniformly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Distributed Sensor Networks and Detection Algorithms
