Achievable distributional robustness when the robust risk is only   partially identified

Julia Kostin; Nicola Gnecco; Fanny Yang

arXiv:2502.02710·stat.ML·February 6, 2025

Achievable distributional robustness when the robust risk is only partially identified

Julia Kostin, Nicola Gnecco, Fanny Yang

PDF

Open Access 1 Video

TL;DR

This paper explores the limits of distributional robustness in machine learning when the robust risk cannot be fully identified, proposing a new measure that improves generalization in partially identifiable scenarios.

Contribution

It introduces the worst-case robust risk as a new robustness measure applicable under partial identifiability, and demonstrates its advantages over existing methods in theory and real data.

Findings

01

Existing robustness methods are suboptimal under partial identifiability.

02

The proposed measure improves test error in gene expression data.

03

Partial identifiability allows for better robustness than traditional approaches.

Abstract

In safety-critical applications, machine learning models should generalize well under worst-case distribution shifts, that is, have a small robust risk. Invariance-based algorithms can provably take advantage of structural assumptions on the shifts when the training distributions are heterogeneous enough to identify the robust risk. However, in practice, such identifiability conditions are rarely satisfied -- a scenario so far underexplored in the theoretical literature. In this paper, we aim to fill the gap and propose to study the more general setting when the robust risk is only partially identifiable. In particular, we introduce the worst-case robust risk as a new measure of robustness that is always well-defined regardless of identifiability. Its minimum corresponds to an algorithm-independent (population) minimax quantity that measures the best achievable robustness under partial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Achievable distributional robustness when the robust risk is only partially identified· slideslive

Taxonomy

TopicsRisk and Portfolio Optimization · Market Dynamics and Volatility