Graph Inverse Style Transfer for Counterfactual Explainability

Bardh Prenkaj; Efstratios Zaradoukas; Gjergji Kasneci

arXiv:2505.17542·cs.LG·July 8, 2025

Graph Inverse Style Transfer for Counterfactual Explainability

Bardh Prenkaj, Efstratios Zaradoukas, Gjergji Kasneci

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Graph Inverse Style Transfer (GIST), a novel framework for graph counterfactual explainability that uses spectral style transfer and backtracking to generate valid and faithful counterfactuals, outperforming existing methods.

Contribution

GIST is the first framework to reframe graph counterfactual generation as a backtracking process using spectral style transfer, improving validity and faithfulness of explanations.

Findings

01

+7.6% improvement in counterfactual validity

02

+45.5% better explanation fidelity

03

Effective mitigation of decision boundary overshooting

Abstract

Counterfactual explainability seeks to uncover model decisions by identifying minimal changes to the input that alter the predicted outcome. This task becomes particularly challenging for graph data due to preserving structural integrity and semantic meaning. Unlike prior approaches that rely on forward perturbation mechanisms, we introduce Graph Inverse Style Transfer (GIST), the first framework to re-imagine graph counterfactual generation as a backtracking process, leveraging spectral style transfer. By aligning the global structure with the original input spectrum and preserving local content faithfulness, GIST produces valid counterfactuals as interpolations between the input style and counterfactual content. Tested on 8 binary and multi-class graph classification benchmarks, GIST achieves a remarkable +7.6% improvement in the validity of produced counterfactuals and significant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bardhprenkaj/gist
pytorchOfficial

Videos

Graph Inverse Style Transfer for Counterfactual Explainability· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning

MethodsCounterfactuals Explanations