Achievable distributional robustness when the robust risk is only partially identified
Julia Kostin, Nicola Gnecco, Fanny Yang

TL;DR
This paper explores the limits of distributional robustness in machine learning when the robust risk cannot be fully identified, proposing a new measure that improves generalization in partially identifiable scenarios.
Contribution
It introduces the worst-case robust risk as a new robustness measure applicable under partial identifiability, and demonstrates its advantages over existing methods in theory and real data.
Findings
Existing robustness methods are suboptimal under partial identifiability.
The proposed measure improves test error in gene expression data.
Partial identifiability allows for better robustness than traditional approaches.
Abstract
In safety-critical applications, machine learning models should generalize well under worst-case distribution shifts, that is, have a small robust risk. Invariance-based algorithms can provably take advantage of structural assumptions on the shifts when the training distributions are heterogeneous enough to identify the robust risk. However, in practice, such identifiability conditions are rarely satisfied -- a scenario so far underexplored in the theoretical literature. In this paper, we aim to fill the gap and propose to study the more general setting when the robust risk is only partially identifiable. In particular, we introduce the worst-case robust risk as a new measure of robustness that is always well-defined regardless of identifiability. Its minimum corresponds to an algorithm-independent (population) minimax quantity that measures the best achievable robustness under partial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRisk and Portfolio Optimization · Market Dynamics and Volatility
