Invariant Risk Minimization

Martin Arjovsky; L\'eon Bottou; Ishaan Gulrajani; David Lopez-Paz

arXiv:1907.02893·stat.ML·March 31, 2020·321 cites

Invariant Risk Minimization

Martin Arjovsky, L\'eon Bottou, Ishaan Gulrajani, David Lopez-Paz

PDF

Open Access 5 Repos

TL;DR

Invariant Risk Minimization (IRM) is a new learning approach that aims to find data representations with stable, invariant correlations across different training environments, improving out-of-distribution generalization by capturing causal structures.

Contribution

IRM introduces a novel framework for learning invariant representations that align with causal factors, advancing out-of-distribution robustness in machine learning models.

Findings

01

IRM learns representations with invariant correlations across distributions.

02

Theoretical analysis links IRM invariances to causal structures.

03

Experiments demonstrate IRM's improved out-of-distribution generalization.

Abstract

We introduce Invariant Risk Minimization (IRM), a learning paradigm to estimate invariant correlations across multiple training distributions. To achieve this goal, IRM learns a data representation such that the optimal classifier, on top of that data representation, matches for all training distributions. Through theory and experiments, we show how the invariances learned by IRM relate to the causal structures governing the data and enable out-of-distribution generalization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Machine Learning and Algorithms