Effective Robustness against Natural Distribution Shifts for Models with   Different Training Data

Zhouxing Shi; Nicholas Carlini; Ananth Balashankar; Ludwig Schmidt,; Cho-Jui Hsieh; Alex Beutel; Yao Qin

arXiv:2302.01381·cs.LG·October 31, 2023

Effective Robustness against Natural Distribution Shifts for Models with Different Training Data

Zhouxing Shi, Nicholas Carlini, Ananth Balashankar, Ludwig Schmidt,, Cho-Jui Hsieh, Alex Beutel, Yao Qin

PDF

Open Access 1 Video

TL;DR

This paper introduces a new evaluation metric for effective robustness that accounts for models trained on different data distributions, providing more accurate comparisons of out-of-distribution robustness.

Contribution

It proposes a novel evaluation method controlling for multiple in-distribution test sets, improving robustness assessment across diverse training data.

Findings

01

The new metric offers a better estimate of effective robustness across different training data.

02

It explains the diminished robustness gains of zero-shot models under the new evaluation.

03

Interactive visualizations are provided for better understanding of robustness comparisons.

Abstract

"Effective robustness" measures the extra out-of-distribution (OOD) robustness beyond what can be predicted from the in-distribution (ID) performance. Existing effective robustness evaluations typically use a single test set such as ImageNet to evaluate the ID accuracy. This becomes problematic when evaluating models trained on different data distributions, e.g., comparing models trained on ImageNet vs. zero-shot language-image pre-trained models trained on LAION. In this paper, we propose a new evaluation metric to evaluate and compare the effective robustness of models trained on different data. To do this, we control for the accuracy on multiple ID test sets that cover the training distributions for all the evaluated models. Our new evaluation metric provides a better estimate of effective robustness when there are models with different training data. It may also explain the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Effective Robustness against Natural Distribution Shifts for Models with Different Training Data· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Anomaly Detection Techniques and Applications