Understanding the Failure Modes of Out-of-Distribution Generalization

Vaishnavh Nagarajan; Anders Andreassen; Behnam Neyshabur

arXiv:2010.15775·cs.LG·September 10, 2024·50 cites

Understanding the Failure Modes of Out-of-Distribution Generalization

Vaishnavh Nagarajan, Anders Andreassen, Behnam Neyshabur

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates why machine learning models fail to generalize out-of-distribution, identifying fundamental geometric and statistical failure modes through theoretical analysis and dataset modifications.

Contribution

It uncovers two fundamental failure modes in OOD generalization caused by spurious correlations, supported by theoretical analysis and experimental dataset modifications.

Findings

01

Identifies geometric and statistical failure modes in OOD generalization.

02

Demonstrates these failure modes can be isolated in neural network training.

03

Provides dataset modifications to study failure modes in practice.

Abstract

Empirical studies suggest that machine learning models often rely on features, such as the background, that may be spuriously correlated with the label only during training time, resulting in poor accuracy during test-time. In this work, we identify the fundamental factors that give rise to this behavior, by explaining why models fail this way {\em even} in easy-to-learn tasks where one would expect these models to succeed. In particular, through a theoretical study of gradient-descent-trained linear classifiers on some easy-to-learn tasks, we uncover two complementary failure modes. These modes arise from how spurious correlations induce two kinds of skews in the data: one geometric in nature, and another, statistical in nature. Finally, we construct natural modifications of image classification datasets to understand when these failure modes can arise in practice. We also design…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/OOD-failures
noneOfficial

Videos

Understanding the failure modes of out-of-distribution generalization· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification