Counterfactual Graphs for Explainable Classification of Brain Networks
Carlo Abrate, Francesco Bonchi

TL;DR
This paper introduces counterfactual graphs as a novel method for explaining black-box graph classifiers in brain network analysis, aiding neuroscientists in understanding model decisions and uncovering brain structure insights.
Contribution
It proposes a new approach for generating counterfactual explanations for graph classifiers, with empirical methods to produce near-optimal counterfactuals and tools for global model interpretation.
Findings
Heuristic methods produce counterfactuals close to optimal.
Counterfactual explanations help interpret black-box classifiers.
Global explanations provide insights into model behavior.
Abstract
Training graph classifiers able to distinguish between healthy brains and dysfunctional ones, can help identifying substructures associated to specific cognitive phenotypes. However, the mere predictive power of the graph classifier is of limited interest to the neuroscientists, which have plenty of tools for the diagnosis of specific mental disorders. What matters is the interpretation of the model, as it can provide novel insights and new hypotheses. In this paper we propose \emph{counterfactual graphs} as a way to produce local post-hoc explanations of any black-box graph classifier. Given a graph and a black-box, a counterfactual is a graph which, while having high structural similarity with the original graph, is classified by the black-box in a different class. We propose and empirically compare several strategies for counterfactual graph search. Our experiments against a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Functional Brain Connectivity Studies · Machine Learning in Healthcare
MethodsCounterfactuals Explanations
