Direct-Effect Risk Minimization for Domain Generalization

Yuhui Li; Zejia Wu; Chao Zhang; Hongyang Zhang

arXiv:2211.14594·cs.LG·March 17, 2023

Direct-Effect Risk Minimization for Domain Generalization

Yuhui Li, Zejia Wu, Chao Zhang, Hongyang Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a causal inference-based method to improve out-of-distribution generalization by minimizing indirect effects, effectively handling correlation shifts across domains.

Contribution

The paper proposes a novel two-stage algorithm that isolates and removes indirect effects to enhance domain generalization, compatible with existing methods.

Findings

01

Improved performance on 5 correlation-shifted datasets

02

Enhanced generalization on DomainBed benchmark

03

Effective removal of indirect effects

Abstract

We study the problem of out-of-distribution (o.o.d.) generalization where spurious correlations of attributes vary across training and test domains. This is known as the problem of correlation shift and has posed concerns on the reliability of machine learning. In this work, we introduce the concepts of direct and indirect effects from causal inference to the domain generalization problem. We argue that models that learn direct effects minimize the worst-case risk across correlation-shifted domains. To eliminate the indirect effects, our algorithm consists of two stages: in the first stage, we learn an indirect-effect representation by minimizing the prediction error of domain labels using the representation and the class labels; in the second stage, we remove the indirect effects learned in the first stage by matching each data with another data of similar indirect-effect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liyuhui-12/drmfordg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Multimodal Machine Learning Applications

MethodsTest