Revisiting Spurious Correlation in Domain Generalization
Bin Qin, Jiangmeng Li, Yi Li, Xuesong Wu, Yupeng Wang, Wenwen Qiang,, Jianwen Cao

TL;DR
This paper revisits the problem of spurious correlations in domain generalization, proposing a new SCM for representation learning and a propensity score weighted estimator to improve out-of-distribution generalization.
Contribution
It introduces a novel SCM tailored for representation learning and a plug-and-play bias control method for better OOD generalization.
Findings
Effective on synthetic datasets
Improves OOD performance on real datasets
Validates the proposed approach through extensive experiments
Abstract
Without loss of generality, existing machine learning techniques may learn spurious correlation dependent on the domain, which exacerbates the generalization of models in out-of-distribution (OOD) scenarios. To address this issue, recent works build a structural causal model (SCM) to describe the causality within data generation process, thereby motivating methods to avoid the learning of spurious correlation by models. However, from the machine learning viewpoint, such a theoretical analysis omits the nuanced difference between the data generation process and representation learning process, resulting in that the causal analysis based on the former cannot well adapt to the latter. To this end, we explore to build a SCM for representation learning process and further conduct a thorough analysis of the mechanisms underlying spurious correlation. We underscore that adjusting erroneous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Manufacturing Process and Optimization
