Out-of-distribution Generalization in the Presence of Nuisance-Induced   Spurious Correlations

Aahlad Puli; Lily H. Zhang; Eric K. Oermann; Rajesh Ranganath

arXiv:2107.00520·cs.LG·February 14, 2023·5 cites

Out-of-distribution Generalization in the Presence of Nuisance-Induced Spurious Correlations

Aahlad Puli, Lily H. Zhang, Eric K. Oermann, Rajesh Ranganath

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Nuisance-Randomized Distillation (NURD), a method for training models that maintain high performance across varying nuisance-label relationships by focusing on representations independent of nuisances.

Contribution

The paper proposes a novel approach, NURD, that finds representations invariant to nuisances, ensuring robust out-of-distribution generalization in the presence of spurious correlations.

Findings

01

NURD outperforms baseline models in chest X-ray classification with spurious background correlations.

02

NURD's representations are independent of nuisance variables under the nuisance-randomized distribution.

03

Models trained with NURD maintain high accuracy across different nuisance-label relationships.

Abstract

In many prediction problems, spurious correlations are induced by a changing relationship between the label and a nuisance variable that is also correlated with the covariates. For example, in classifying animals in natural images, the background, which is a nuisance, can predict the type of animal. This nuisance-label relationship does not always hold, and the performance of a model trained under one such relationship may be poor on data with a different nuisance-label relationship. To build predictive models that perform well regardless of the nuisance-label relationship, we develop Nuisance-Randomized Distillation (NURD). We introduce the nuisance-randomized distribution, a distribution where the nuisance and the label are independent. Under this distribution, we define the set of representations such that conditioning on any member, the nuisance and the label remain independent. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rajesh-lab/nurd-code-public
pytorchOfficial

Videos

Out-of-distribution Generalization in the Presence of Nuisance-Induced Spurious Correlations· slideslive

Taxonomy

TopicsCOVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification