Quantifying Explanation Quality in Graph Neural Networks using Out-of-Distribution Generalization
Ding Zhang, Siddharth Betala, Chirag Agarwal

TL;DR
This paper introduces the Explanation-Generalization Score (EGS), a new metric for evaluating GNN explanations based on their causal relevance and stability under distribution shifts, validated across diverse datasets.
Contribution
We propose EGS, a novel evaluation metric for GNN explanations that assesses causal relevance through out-of-distribution generalization, addressing limitations of existing metrics.
Findings
EGS effectively ranks explainers by causal relevance.
EGS correlates with true causal substructure detection.
Large-scale validation confirms EGS's robustness and effectiveness.
Abstract
Evaluating the quality of post-hoc explanations for Graph Neural Networks (GNNs) remains a significant challenge. While recent years have seen an increasing development of explainability methods, current evaluation metrics (e.g., fidelity, sparsity) often fail to assess whether an explanation identifies the true underlying causal variables. To address this, we propose the Explanation-Generalization Score (EGS), a metric that quantifies the causal relevance of GNN explanations. EGS is founded on the principle of feature invariance and posits that if an explanation captures true causal drivers, it should lead to stable predictions across distribution shifts. To quantify this, we introduce a framework that trains GNNs using explanatory subgraphs and evaluates their performance in Out-of-Distribution (OOD) settings (here, OOD generalization serves as a rigorous proxy for the explanation's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks · Adversarial Robustness in Machine Learning
