Sufficient Invariant Learning for Distribution Shift

Taero Kim; Subeen Park; Sungjun Lim; Yonghan Jung; Krikamol Muandet; Kyungwoo Song

arXiv:2210.13533·cs.LG·September 15, 2025·1 cites

Sufficient Invariant Learning for Distribution Shift

Taero Kim, Subeen Park, Sungjun Lim, Yonghan Jung, Krikamol Muandet, Kyungwoo Song

PDF

Open Access

TL;DR

This paper introduces the Sufficient Invariant Learning framework and the ASGDRO algorithm to improve model robustness under distribution shifts by learning diverse, sufficient invariant features, addressing limitations of existing invariant learning methods.

Contribution

It proposes a novel SIL framework and ASGDRO algorithm that learn diverse invariant features by seeking common flat minima, enhancing robustness to distribution shifts.

Findings

01

ASGDRO outperforms existing methods on multiple datasets.

02

Learning diverse invariant features improves robustness.

03

Theoretical analysis confirms the benefit of common flat minima.

Abstract

Learning robust models under distribution shifts between training and test datasets is a fundamental challenge in machine learning. While learning invariant features across environments is a popular approach, it often assumes that these features are fully observed in both training and test sets, a condition frequently violated in practice. When models rely on invariant features absent in the test set, their robustness in new environments can deteriorate. To tackle this problem, we introduce a novel learning principle called the Sufficient Invariant Learning (SIL) framework, which focuses on learning a sufficient subset of invariant features rather than relying on a single feature. After demonstrating the limitation of existing invariant learning methods, we propose a new algorithm, Adaptive Sharpness-aware Group Distributionally Robust Optimization (ASGDRO), to learn diverse invariant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsTest · Sharpness-Aware Minimization