Direct-Effect Risk Minimization for Domain Generalization
Yuhui Li, Zejia Wu, Chao Zhang, Hongyang Zhang

TL;DR
This paper introduces a causal inference-based method to improve out-of-distribution generalization by minimizing indirect effects, effectively handling correlation shifts across domains.
Contribution
The paper proposes a novel two-stage algorithm that isolates and removes indirect effects to enhance domain generalization, compatible with existing methods.
Findings
Improved performance on 5 correlation-shifted datasets
Enhanced generalization on DomainBed benchmark
Effective removal of indirect effects
Abstract
We study the problem of out-of-distribution (o.o.d.) generalization where spurious correlations of attributes vary across training and test domains. This is known as the problem of correlation shift and has posed concerns on the reliability of machine learning. In this work, we introduce the concepts of direct and indirect effects from causal inference to the domain generalization problem. We argue that models that learn direct effects minimize the worst-case risk across correlation-shifted domains. To eliminate the indirect effects, our algorithm consists of two stages: in the first stage, we learn an indirect-effect representation by minimizing the prediction error of domain labels using the representation and the class labels; in the second stage, we remove the indirect effects learned in the first stage by matching each data with another data of similar indirect-effect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Multimodal Machine Learning Applications
MethodsTest
