Learning Causally Invariant Representations for Out-of-Distribution Generalization on Graphs
Yongqiang Chen, Yonggang Zhang, Yatao Bian, Han Yang, Kaili Ma,, Binghui Xie, Tongliang Liu, Bo Han, James Cheng

TL;DR
This paper introduces CIGA, a framework for learning causally invariant subgraph representations to improve out-of-distribution generalization on graph data, addressing challenges posed by diverse distribution shifts.
Contribution
CIGA leverages causal models and an information-theoretic objective to identify invariant subgraphs, enhancing OOD robustness without requiring environment labels.
Findings
CIGA outperforms baselines on 16 datasets.
It achieves superior OOD generalization in drug discovery tasks.
The method effectively captures invariant causal subgraphs.
Abstract
Despite recent success in using the invariance principle for out-of-distribution (OOD) generalization on Euclidean data (e.g., images), studies on graph data are still limited. Different from images, the complex nature of graphs poses unique challenges to adopting the invariance principle. In particular, distribution shifts on graphs can appear in a variety of forms such as attributes and structures, making it difficult to identify the invariance. Moreover, domain or environment partitions, which are often required by OOD methods on Euclidean data, could be highly expensive to obtain for graphs. To bridge this gap, we propose a new framework, called Causality Inspired Invariant Graph LeArning (CIGA), to capture the invariance of graphs for guaranteed OOD generalization under various distribution shifts. Specifically, we characterize potential distribution shifts on graphs with causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHealth, Environment, Cognitive Aging · Machine Learning in Healthcare · Data-Driven Disease Surveillance
