Graph Counterfactual Explainable AI via Latent Space Traversal
Andreas Abildtrup Hansen, Paraskevas Pegios, Anna Calissano, Aasa, Feragen

TL;DR
This paper introduces a novel method for generating counterfactual explanations for graph neural networks by traversing a latent space in a permutation-equivariant autoencoder, enabling continuous and robust explanations.
Contribution
It proposes a permutation-equivariant graph variational autoencoder to produce continuous counterfactual explanations for any differentiable graph classifier.
Findings
High-performing counterfactual explanations on three datasets
More robust than baseline methods
Seamless integration of discrete and continuous graph features
Abstract
Explaining the predictions of a deep neural network is a nontrivial task, yet high-quality explanations for predictions are often a prerequisite for practitioners to trust these models. Counterfactual explanations aim to explain predictions by finding the ''nearest'' in-distribution alternative input whose prediction changes in a pre-specified way. However, it remains an open question how to define this nearest alternative input, whose solution depends on both the domain (e.g. images, graphs, tabular data, etc.) and the specific application considered. For graphs, this problem is complicated i) by their discrete nature, as opposed to the continuous nature of state-of-the-art graph classifiers; and ii) by the node permutation group acting on the graphs. We propose a method to generate counterfactual explanations for any differentiable black-box graph classifier, utilizing a case-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
