Learning Causally Invariant Representations for Out-of-Distribution   Generalization on Graphs

Yongqiang Chen; Yonggang Zhang; Yatao Bian; Han Yang; Kaili Ma,; Binghui Xie; Tongliang Liu; Bo Han; James Cheng

arXiv:2202.05441·cs.LG·October 12, 2022·49 cites

Learning Causally Invariant Representations for Out-of-Distribution Generalization on Graphs

Yongqiang Chen, Yonggang Zhang, Yatao Bian, Han Yang, Kaili Ma,, Binghui Xie, Tongliang Liu, Bo Han, James Cheng

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces CIGA, a framework for learning causally invariant subgraph representations to improve out-of-distribution generalization on graph data, addressing challenges posed by diverse distribution shifts.

Contribution

CIGA leverages causal models and an information-theoretic objective to identify invariant subgraphs, enhancing OOD robustness without requiring environment labels.

Findings

01

CIGA outperforms baselines on 16 datasets.

02

It achieves superior OOD generalization in drug discovery tasks.

03

The method effectively captures invariant causal subgraphs.

Abstract

Despite recent success in using the invariance principle for out-of-distribution (OOD) generalization on Euclidean data (e.g., images), studies on graph data are still limited. Different from images, the complex nature of graphs poses unique challenges to adopting the invariance principle. In particular, distribution shifts on graphs can appear in a variety of forms such as attributes and structures, making it difficult to identify the invariance. Moreover, domain or environment partitions, which are often required by OOD methods on Euclidean data, could be highly expensive to obtain for graphs. To bridge this gap, we propose a new framework, called Causality Inspired Invariant Graph LeArning (CIGA), to capture the invariance of graphs for guaranteed OOD generalization under various distribution shifts. Specifically, we characterize potential distribution shifts on graphs with causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Learning Causally Invariant Representations for Out-of-Distribution Generalization on Graphs· slideslive

Taxonomy

TopicsHealth, Environment, Cognitive Aging · Machine Learning in Healthcare · Data-Driven Disease Surveillance