Instrumental Variable-Driven Domain Generalization with Unobserved Confounders
Junkun Yuan, Xu Ma, Ruoxuan Xiong, Mingming Gong, Xiangyu Liu, Fei Wu,, Lanfen Lin, Kun Kuang

TL;DR
This paper introduces IV-DG, a novel domain generalization approach leveraging instrumental variables to address unobserved confounders, improving invariant relationship learning across unseen domains with theoretical and empirical validation.
Contribution
The paper proposes a causal, instrumental variable-based method for domain generalization that effectively removes bias from unobserved confounders, a novel approach in this context.
Findings
Achieves state-of-the-art results on real-world datasets.
Accurately captures invariant relationships across domains.
Theoretically validated through analysis and simulations.
Abstract
Domain generalization (DG) aims to learn from multiple source domains a model that can generalize well on unseen target domains. Existing DG methods mainly learn the representations with invariant marginal distribution of the input features, however, the invariance of the conditional distribution of the labels given the input features is more essential for unknown domain prediction. Meanwhile, the existing of unobserved confounders which affect the input features and labels simultaneously cause spurious correlation and hinder the learning of the invariant relationship contained in the conditional distribution. Interestingly, with a causal view on the data generating process, we find that the input features of one domain are valid instrumental variables for other domains. Inspired by this finding, we propose an instrumental variable-driven DG method (IV-DG) by removing the bias of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research
