Graph Out-of-Distribution Generalization with Controllable Data   Augmentation

Bin Lu; Xiaoying Gan; Ze Zhao; Shiyu Liang; Luoyi Fu; Xinbing Wang,; Chenghu Zhou

arXiv:2308.08344·cs.LG·August 17, 2023

Graph Out-of-Distribution Generalization with Controllable Data Augmentation

Bin Lu, Xiaoying Gan, Ze Zhao, Shiyu Liang, Luoyi Fu, Xinbing Wang,, Chenghu Zhou

PDF

Open Access

TL;DR

This paper introduces OOD-GMixup, a controllable data augmentation method for GNNs that improves out-of-distribution generalization by generating virtual samples and calibrating distribution deviation.

Contribution

It proposes a novel approach combining graph rationale extraction, virtual sample generation, and distribution calibration to enhance OOD robustness in graph classification.

Findings

01

Outperforms state-of-the-art baselines on real-world datasets.

02

Effectively mitigates hybrid structure distribution shift.

03

Improves stability across different graph datasets.

Abstract

Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe \emph{hybrid structure distribution shift} of both scale and density, despite of one-sided biased data partition. The spurious correlations over hybrid distribution deviation degrade the performance of previous GNN methods and show large instability among different datasets. To alleviate this problem, we propose \texttt{OOD-GMixup} to jointly manipulate the training distribution with \emph{controllable data augmentation} in metric space. Specifically, we first extract the graph rationales to eliminate the spurious correlations due to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Complex Network Analysis Techniques · Bayesian Modeling and Causal Inference