TL;DR
This paper introduces a causal attention method for graph neural networks to improve interpretability and generalization by distinguishing causal features from confounding shortcuts, addressing issues of spurious correlations and distribution shifts.
Contribution
It proposes the Causal Attention Learning (CAL) strategy that estimates causal and shortcut features and applies backdoor adjustment to enhance GNN robustness and interpretability.
Findings
CAL improves out-of-distribution generalization.
CAL enhances interpretability of graph classifiers.
Experimental results show superior performance over baselines.
Abstract
In graph classification, attention and pooling-based graph neural networks (GNNs) prevail to extract the critical features from the input graph and support the prediction. They mostly follow the paradigm of learning to attend, which maximizes the mutual information between the attended graph and the ground-truth label. However, this paradigm makes GNN classifiers recklessly absorb all the statistical correlations between input features and labels in the training data, without distinguishing the causal and noncausal effects of features. Instead of underscoring the causal features, the attended graphs are prone to visit the noncausal features as the shortcut to predictions. Such shortcut features might easily change outside the training distribution, thereby making the GNN classifiers suffer from poor generalization. In this work, we take a causal look at the GNN modeling for graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
