A Research and Development Portfolio of GNN Centric Malware Detection, Explainability, and Dataset Curation
Hossein Shokouhinejad, Griffin Higgins, Roozbeh Razavi-Far, Ali A. Ghorbani

TL;DR
This paper presents a comprehensive research portfolio on GNN-based malware detection, focusing on scalability, interpretability, dataset curation, and introducing new methods and explanations to advance the field.
Contribution
It combines surveys, novel graph reduction techniques, explanation methods, ensemble models, and curated datasets to improve GNN malware detection and interpretability.
Findings
Enhanced explainability with dual subgraph matching techniques.
Improved model interpretability using attention-guided GNN ensembles.
Curated datasets facilitate reproducibility and future research.
Abstract
Graph Neural Networks (GNNs) have become an effective tool for malware detection by capturing program execution through graph-structured representations. However, important challenges remain regarding scalability, interpretability, and the availability of reliable datasets. This paper brings together six related studies that collectively address these issues. The portfolio begins with a survey of graph-based malware detection and explainability, then advances to new graph reduction methods, integrated reduction-learning approaches, and investigations into the consistency of explanations. It also introduces dual explanation techniques based on subgraph matching and develops ensemble-based models with attention-guided stacked GNNs to improve interpretability. In parallel, curated datasets of control flow graphs are released to support reproducibility and enable future research. Together,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
