Graph Out-of-Distribution Generalization with Controllable Data Augmentation
Bin Lu, Xiaoying Gan, Ze Zhao, Shiyu Liang, Luoyi Fu, Xinbing Wang,, Chenghu Zhou

TL;DR
This paper introduces OOD-GMixup, a controllable data augmentation method for GNNs that improves out-of-distribution generalization by generating virtual samples and calibrating distribution deviation.
Contribution
It proposes a novel approach combining graph rationale extraction, virtual sample generation, and distribution calibration to enhance OOD robustness in graph classification.
Findings
Outperforms state-of-the-art baselines on real-world datasets.
Effectively mitigates hybrid structure distribution shift.
Improves stability across different graph datasets.
Abstract
Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe \emph{hybrid structure distribution shift} of both scale and density, despite of one-sided biased data partition. The spurious correlations over hybrid distribution deviation degrade the performance of previous GNN methods and show large instability among different datasets. To alleviate this problem, we propose \texttt{OOD-GMixup} to jointly manipulate the training distribution with \emph{controllable data augmentation} in metric space. Specifically, we first extract the graph rationales to eliminate the spurious correlations due to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Complex Network Analysis Techniques · Bayesian Modeling and Causal Inference
