Illuminati: Towards Explaining Graph Neural Networks for Cybersecurity Analysis
Haoyu He, Yuede Ji, H. Howie Huang

TL;DR
This paper introduces Illuminati, a comprehensive explanation framework for GNNs in cybersecurity, capable of identifying key graph components contributing to predictions without prior GNN knowledge, improving interpretability and accuracy.
Contribution
Illuminati is a novel, domain-agnostic explanation framework that enhances interpretability of GNNs in cybersecurity applications, outperforming existing methods in accuracy and clarity.
Findings
Achieves 87.6% subgraph retention of original prediction
Outperforms state-of-the-art explanation methods by 10.3%
Provides explanations easily understood by domain experts
Abstract
Graph neural networks (GNNs) have been utilized to create multi-layer graph models for a number of cybersecurity applications from fraud detection to software vulnerability analysis. Unfortunately, like traditional neural networks, GNNs also suffer from a lack of transparency, that is, it is challenging to interpret the model predictions. Prior works focused on specific factor explanations for a GNN model. In this work, we have designed and implemented Illuminati, a comprehensive and accurate explanation framework for cybersecurity applications using GNN models. Given a graph and a pre-trained GNN model, Illuminati is able to identify the important nodes, edges, and attributes that are contributing to the prediction while requiring no prior knowledge of GNN models. We evaluate Illuminati in two cybersecurity applications, i.e., code vulnerability detection and smart contract…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Adversarial Robustness in Machine Learning
