Counterfactual Graphs for Explainable Classification of Brain Networks

Carlo Abrate; Francesco Bonchi

arXiv:2106.08640·cs.SI·June 21, 2021

Counterfactual Graphs for Explainable Classification of Brain Networks

Carlo Abrate, Francesco Bonchi

PDF

Open Access 1 Repo

TL;DR

This paper introduces counterfactual graphs as a novel method for explaining black-box graph classifiers in brain network analysis, aiding neuroscientists in understanding model decisions and uncovering brain structure insights.

Contribution

It proposes a new approach for generating counterfactual explanations for graph classifiers, with empirical methods to produce near-optimal counterfactuals and tools for global model interpretation.

Findings

01

Heuristic methods produce counterfactuals close to optimal.

02

Counterfactual explanations help interpret black-box classifiers.

03

Global explanations provide insights into model behavior.

Abstract

Training graph classifiers able to distinguish between healthy brains and dysfunctional ones, can help identifying substructures associated to specific cognitive phenotypes. However, the mere predictive power of the graph classifier is of limited interest to the neuroscientists, which have plenty of tools for the diagnosis of specific mental disorders. What matters is the interpretation of the model, as it can provide novel insights and new hypotheses. In this paper we propose \emph{counterfactual graphs} as a way to produce local post-hoc explanations of any black-box graph classifier. Given a graph and a black-box, a counterfactual is a graph which, while having high structural similarity with the original graph, is classified by the black-box in a different class. We propose and empirically compare several strategies for counterfactual graph search. Our experiments against a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

carlo-abrate/CounterfactualGraphs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Functional Brain Connectivity Studies · Machine Learning in Healthcare

MethodsCounterfactuals Explanations