Revisiting Spurious Correlation in Domain Generalization

Bin Qin; Jiangmeng Li; Yi Li; Xuesong Wu; Yupeng Wang; Wenwen Qiang,; Jianwen Cao

arXiv:2406.11517·cs.LG·June 18, 2024

Revisiting Spurious Correlation in Domain Generalization

Bin Qin, Jiangmeng Li, Yi Li, Xuesong Wu, Yupeng Wang, Wenwen Qiang,, Jianwen Cao

PDF

Open Access

TL;DR

This paper revisits the problem of spurious correlations in domain generalization, proposing a new SCM for representation learning and a propensity score weighted estimator to improve out-of-distribution generalization.

Contribution

It introduces a novel SCM tailored for representation learning and a plug-and-play bias control method for better OOD generalization.

Findings

01

Effective on synthetic datasets

02

Improves OOD performance on real datasets

03

Validates the proposed approach through extensive experiments

Abstract

Without loss of generality, existing machine learning techniques may learn spurious correlation dependent on the domain, which exacerbates the generalization of models in out-of-distribution (OOD) scenarios. To address this issue, recent works build a structural causal model (SCM) to describe the causality within data generation process, thereby motivating methods to avoid the learning of spurious correlation by models. However, from the machine learning viewpoint, such a theoretical analysis omits the nuanced difference between the data generation process and representation learning process, resulting in that the causal analysis based on the former cannot well adapt to the latter. To this end, we explore to build a SCM for representation learning process and further conduct a thorough analysis of the mechanisms underlying spurious correlation. We underscore that adjusting erroneous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Manufacturing Process and Optimization