Beyond Edge Deletion: A Comprehensive Approach to Counterfactual Explanation in Graph Neural Networks
Matteo De Sanctis, Riccardo De Sanctis, Stefano Faralli, Paola Velardi, Bardh Prenkaj

TL;DR
This paper introduces XPlore, a gradient-guided method that expands the counterfactual explanation search space for GNNs by including edge insertions and node-feature perturbations, improving interpretability in high-stakes applications.
Contribution
XPlore is a novel technique that broadens counterfactual search in GNNs by jointly optimizing edge insertions and node features within a gradient-based framework, surpassing prior edge deletion methods.
Findings
Up to 56.3% improvement in validity over baselines
Up to 52.8% improvement in fidelity over baselines
Produces more coherent and minimal counterfactuals
Abstract
Graph Neural Networks (GNNs) are increasingly adopted across domains such as molecular biology and social network analysis, yet their black-box nature hinders interpretability and trust. This is especially problematic in high-stakes applications, such as predicting molecule toxicity, drug discovery, or guiding financial fraud detections, where transparent explanations are essential. Counterfactual explanations - minimal changes that flip a model's prediction - offer a transparent lens into GNNs' behavior. In this work, we introduce XPlore, a novel technique that significantly broadens the counterfactual search space. It consists of gradient-guided perturbations to adjacency and node feature matrices. Unlike most prior methods, which focus solely on edge deletions, our approach belongs to the growing class of techniques that optimize edge insertions and node-feature perturbations, here…
Peer Reviews
Decision·Submitted to ICLR 2026
- XPlore covers graph modifications that most GCE methods don't. Insertions + feature shifts matter. - Performance on benchmarks against baselines is very strong. - The authors are honest about their OOD performance and highlight key challenges for all methods.
- The method still relies on an oracle model, which adds another degree of freedom for practitioners to consider and means XPlore also likely inherits the oracle's faults. - The OOD discussion seems to suggest that XPlore focuses on model-flipping counterfactuals rather than plausible counterfactuals, which is a significant weakness.
- Clear algorithmic description with well-structured steps; the method is easy to follow. - Extensive experiments with comparisons against multiple competing approaches. - Thoughtful discussion of future directions that can guide subsequent research.
- The contributions are repetitive and could be more concise. For example, contribution points 1 and 4 appear overlapping and could be merged. - Positioning the work as an “extension” of a prior paper weakens the novelty message. - Novelty is limited in parts; for instance, edge addition in counterfactual explanations has prior art. - The motivation for feature perturbations is underdeveloped, and the ablation on this component is limited. - The search space and resulting computational complexit
1. The proposed method XPlore achieves impressive improvements on both validity and fidelity metrics. 2. The method’s performance was validated on 14 datasets, spanning multiple graph types. 3. This paper explicitly acknowledged that the residual OOD effects remain open and links them to robustness of oracles.
1. To my current knowledge, there exists prior work (e.g., C2Explainer) that has already enabled edge insertion and node feature perturbations; this might weaken Xplore’s claimed novelty unless positioned more precisely. 2. While the evaluation was performed on 14 datasets, it is skewed towards molecular/biology category, with only one social network dataset (i.e., COLLAB). As social network analysis might be a key application area for GNN interpretability, adding more datasets in this area woul
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
