On Invariance Penalties for Risk Minimization

Kia Khezeli; Arno Blaas; Frank Soboczenski; Nicholas Chia; John; Kalantari

arXiv:2106.09777·cs.LG·June 21, 2021·1 cites

On Invariance Penalties for Risk Minimization

Kia Khezeli, Arno Blaas, Frank Soboczenski, Nicholas Chia, John, Kalantari

PDF

Open Access

TL;DR

This paper critiques the original invariance penalty in IRM, proposes a new penalty based on the Gramian matrix, and demonstrates improved invariance recovery in linear settings through experiments.

Contribution

It introduces an alternative invariance penalty using the Gramian matrix, addressing limitations of the original IRM approach.

Findings

01

The new penalty recovers invariant representations in linear models.

02

It outperforms the original IRM penalty on domain generalization benchmarks.

03

The approach is effective under mild non-degeneracy conditions.

Abstract

The Invariant Risk Minimization (IRM) principle was first proposed by Arjovsky et al. [2019] to address the domain generalization problem by leveraging data heterogeneity from differing experimental conditions. Specifically, IRM seeks to find a data representation under which an optimal classifier remains invariant across all domains. Despite the conceptual appeal of IRM, the effectiveness of the originally proposed invariance penalty has recently been brought into question. In particular, there exists counterexamples for which that invariance penalty can be arbitrarily small for non-invariant data representations. We propose an alternative invariance penalty by revisiting the Gramian matrix of the data representation. We discuss the role of its eigenvalues in the relationship between the risk and the invariance penalty, and demonstrate that it is ill-conditioned for said…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Topic Modeling