CF-GNNExplainer: Counterfactual Explanations for Graph Neural Networks
Ana Lucic, Maartje ter Hoeve, Gabriele Tolomei, Maarten de Rijke,, Fabrizio Silvestri

TL;DR
CF-GNNExplainer introduces a novel approach for generating counterfactual explanations for GNNs by minimally perturbing input graphs through edge deletions to understand prediction changes.
Contribution
The paper presents the first method for counterfactual explanations in GNNs, focusing on minimal edge deletions to alter predictions.
Findings
Achieves over 94% accuracy in generating CF explanations.
Removes less than 3 edges on average per explanation.
Effectively identifies crucial edges influencing GNN predictions.
Abstract
Given the increasing promise of graph neural networks (GNNs) in real-world applications, several methods have been developed for explaining their predictions. Existing methods for interpreting predictions from GNNs have primarily focused on generating subgraphs that are especially relevant for a particular prediction. However, such methods are not counterfactual (CF) in nature: given a prediction, we want to understand how the prediction can be changed in order to achieve an alternative outcome. In this work, we propose a method for generating CF explanations for GNNs: the minimal perturbation to the input (graph) data such that the prediction changes. Using only edge deletions, we find that our method, CF-GNNExplainer, can generate CF explanations for the majority of instances across three widely used datasets for GNN explanations, while removing less than 3 edges on average, with at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks · Adversarial Robustness in Machine Learning
