Out-of-distribution Generalization with Causal Invariant Transformations
Ruoyu Wang, Mingyang Yi, Zhitang Chen, Shengyu Zhu

TL;DR
This paper introduces a method for out-of-distribution generalization that leverages transformations invariant to causal features, enabling models to perform well across diverse domains without explicitly recovering causal features.
Contribution
The work proposes a novel approach using causal invariant transformations to improve OOD generalization, relaxing previous assumptions and providing theoretical guarantees.
Findings
Theoretical proof of minimax optimality with invariant transformations.
Effective regularized training procedure for OOD generalization.
Experimental validation on synthetic and real datasets showing improved performance.
Abstract
In real-world applications, it is important and desirable to learn a model that performs well on out-of-distribution (OOD) data. Recently, causality has become a powerful tool to tackle the OOD generalization problem, with the idea resting on the causal mechanism that is invariant across domains of interest. To leverage the generally unknown causal mechanism, existing works assume a linear form of causal feature or require sufficiently many and diverse training domains, which are usually restrictive in practice. In this work, we obviate these assumptions and tackle the OOD problem without explicitly recovering the causal feature. Our approach is based on transformations that modify the non-causal feature but leave the causal part unchanged, which can be either obtained from prior knowledge or learned from the training data in the multi-domain scenario. Under the setting of invariant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Text and Document Classification Technologies
