Learning Causal Semantic Representation for Out-of-Distribution   Prediction

Chang Liu; Xinwei Sun; Jindong Wang; Haoyue Tang; Tao Li; Tao Qin; Wei; Chen; Tie-Yan Liu

arXiv:2011.01681·stat.ML·November 2, 2021·48 cites

Learning Causal Semantic Representation for Out-of-Distribution Prediction

Chang Liu, Xinwei Sun, Jindong Wang, Haoyue Tang, Tao Li, Tao Qin, Wei, Chen, Tie-Yan Liu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a causal generative model that separates semantic and variation factors, enabling better out-of-distribution prediction and generalization from a single training domain.

Contribution

The paper proposes a novel causal semantic generative model with a variational Bayes approach, improving OOD prediction and providing theoretical guarantees for semantic factor identification.

Findings

01

Enhanced OOD prediction performance over baselines

02

Theoretical proof of semantic factor identification under certain conditions

03

Bounded OOD generalization error and successful adaptation

Abstract

Conventional supervised learning methods, especially deep ones, are found to be sensitive to out-of-distribution (OOD) examples, largely because the learned representation mixes the semantic factor with the variation factor due to their domain-specific correlation, while only the semantic factor causes the output. To address the problem, we propose a Causal Semantic Generative model (CSG) based on a causal reasoning so that the two factors are modeled separately, and develop methods for OOD prediction from a single training domain, which is common and challenging. The methods are based on the causal invariance principle, with a novel design in variational Bayes for both efficient learning and easy prediction. Theoretically, we prove that under certain conditions, CSG can identify the semantic factor by fitting training data, and this semantic-identification guarantees the boundedness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

changliu00/causal-semantic-generative-model
pytorchOfficial

Videos

Learning Causal Semantic Representation for Out-of-Distribution Prediction· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Machine Learning and Data Classification