Graph Inverse Style Transfer for Counterfactual Explainability
Bardh Prenkaj, Efstratios Zaradoukas, Gjergji Kasneci

TL;DR
This paper introduces Graph Inverse Style Transfer (GIST), a novel framework for graph counterfactual explainability that uses spectral style transfer and backtracking to generate valid and faithful counterfactuals, outperforming existing methods.
Contribution
GIST is the first framework to reframe graph counterfactual generation as a backtracking process using spectral style transfer, improving validity and faithfulness of explanations.
Findings
+7.6% improvement in counterfactual validity
+45.5% better explanation fidelity
Effective mitigation of decision boundary overshooting
Abstract
Counterfactual explainability seeks to uncover model decisions by identifying minimal changes to the input that alter the predicted outcome. This task becomes particularly challenging for graph data due to preserving structural integrity and semantic meaning. Unlike prior approaches that rely on forward perturbation mechanisms, we introduce Graph Inverse Style Transfer (GIST), the first framework to re-imagine graph counterfactual generation as a backtracking process, leveraging spectral style transfer. By aligning the global structure with the original input spectrum and preserving local content faithfulness, GIST produces valid counterfactuals as interpolations between the input style and counterfactual content. Tested on 8 binary and multi-class graph classification benchmarks, GIST achieves a remarkable +7.6% improvement in the validity of produced counterfactuals and significant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
MethodsCounterfactuals Explanations
