Global Concept-Based Interpretability for Graph Neural Networks via Neuron Analysis
Han Xuanyuan, Pietro Barbiero, Dobrik Georgiev, Lucie Charlotte, Magister, Pietro Li\'o

TL;DR
This paper introduces a novel global interpretability method for GNNs by analyzing individual neuron behaviors, revealing that neurons act as concept detectors aligned with logical graph properties, and offers a more transparent explanation framework.
Contribution
It is the first to demonstrate that GNN neurons function as concept detectors and to develop a global explanation approach based on neuron-level concepts.
Findings
GNN neurons act as concept detectors aligned with logical graph properties
A trade-off exists between training duration and neuron interpretability
The proposed global explanation method improves interpretability and reduces bias
Abstract
Graph neural networks (GNNs) are highly effective on a variety of graph-related tasks; however, they lack interpretability and transparency. Current explainability approaches are typically local and treat GNNs as black-boxes. They do not look inside the model, inhibiting human trust in the model and explanations. Motivated by the ability of neurons to detect high-level semantic concepts in vision models, we perform a novel analysis on the behaviour of individual GNN neurons to answer questions about GNN interpretability, and propose new metrics for evaluating the interpretability of GNN neurons. We propose a novel approach for producing global explanations for GNNs using neuron-level concepts to enable practitioners to have a high-level view of the model. Specifically, (i) to the best of our knowledge, this is the first work which shows that GNN neurons act as concept detectors and have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Materials Science
