Sufficient Invariant Learning for Distribution Shift
Taero Kim, Subeen Park, Sungjun Lim, Yonghan Jung, Krikamol Muandet, Kyungwoo Song

TL;DR
This paper introduces the Sufficient Invariant Learning framework and the ASGDRO algorithm to improve model robustness under distribution shifts by learning diverse, sufficient invariant features, addressing limitations of existing invariant learning methods.
Contribution
It proposes a novel SIL framework and ASGDRO algorithm that learn diverse invariant features by seeking common flat minima, enhancing robustness to distribution shifts.
Findings
ASGDRO outperforms existing methods on multiple datasets.
Learning diverse invariant features improves robustness.
Theoretical analysis confirms the benefit of common flat minima.
Abstract
Learning robust models under distribution shifts between training and test datasets is a fundamental challenge in machine learning. While learning invariant features across environments is a popular approach, it often assumes that these features are fully observed in both training and test sets, a condition frequently violated in practice. When models rely on invariant features absent in the test set, their robustness in new environments can deteriorate. To tackle this problem, we introduce a novel learning principle called the Sufficient Invariant Learning (SIL) framework, which focuses on learning a sufficient subset of invariant features rather than relying on a single feature. After demonstrating the limitation of existing invariant learning methods, we propose a new algorithm, Adaptive Sharpness-aware Group Distributionally Robust Optimization (ASGDRO), to learn diverse invariant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsTest · Sharpness-Aware Minimization
