Detecting Code Vulnerabilities with Heterogeneous GNN Training
Yu Luo, Weifeng Xu, Dianxiang Xu

TL;DR
This paper introduces a novel heterogeneous GNN-based approach using Inter-Procedural Abstract Graphs for accurate, language-agnostic vulnerability detection in source code, outperforming existing methods.
Contribution
It proposes IPAGs and Heterogeneous Attention GNNs to better model code relationships and improve vulnerability prediction accuracy.
Findings
Achieved up to 96.6% accuracy on C datasets.
Achieved up to 97.8% accuracy on Java datasets.
Outperformed state-of-the-art vulnerability detection methods.
Abstract
Detecting vulnerabilities in source code is a critical task for software security assurance. Graph Neural Network (GNN) machine learning can be a promising approach by modeling source code as graphs. Early approaches treated code elements uniformly, limiting their capacity to model diverse relationships that contribute to various vulnerabilities. Recent research addresses this limitation by considering the heterogeneity of node types and using Gated Graph Neural Networks (GGNN) to aggregate node information through different edge types. However, these edges primarily function as conduits for passing node information and may not capture detailed characteristics of distinct edge types. This paper presents Inter-Procedural Abstract Graphs (IPAGs) as an efficient, language-agnostic representation of source code, complemented by heterogeneous GNN training for vulnerability prediction. IPAGs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Application Security Vulnerabilities · Software Reliability and Analysis Research · Software Engineering Research
