Bitrate-Constrained DRO: Beyond Worst Case Robustness To Unknown Group Shifts
Amrith Setlur, Don Dennis, Benjamin Eysenbach, Aditi Raghunathan,, Chelsea Finn, Virginia Smith, Sergey Levine

TL;DR
This paper introduces BR-DRO, a new robust training method that handles unknown group shifts by constraining the adversary's capacity, improving robustness without needing group labels.
Contribution
BR-DRO is a practical algorithm that does not require group annotations and outperforms existing methods in handling distribution shifts with simple group structures.
Findings
Matches Group DRO performance with group labels
Outperforms CVaR DRO on long-tailed distributions
Provably yields less conservative solutions
Abstract
Training machine learning models robust to distribution shifts is critical for real-world applications. Some robust training algorithms (e.g., Group DRO) specialize to group shifts and require group information on all training points. Other methods (e.g., CVaR DRO) that do not need group annotations can be overly conservative, since they naively upweight high loss points which may form a contrived set that does not correspond to any meaningful group in the real world (e.g., when the high loss points are randomly mislabeled training points). In this work, we address limitations in prior approaches by assuming a more nuanced form of group shift: conditioned on the label, we assume that the true group function (indicator over group) is simple. For example, we may expect that group shifts occur along low bitrate features (e.g., image background, lighting). Thus, we aim to learn a model that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
