Enhancing Code Vulnerability Detection via Vulnerability-Preserving Data Augmentation
Shangqing Liu, Wei Ma, Jian Wang, Xiaofei Xie, Ruitao Feng, Yang, Liu

TL;DR
This paper introduces FGVulDet, a fine-grained vulnerability detection model that uses multiple classifiers and a novel data augmentation technique to improve vulnerability type identification and generalization on large-scale datasets.
Contribution
The paper proposes a multi-classifier framework with vulnerability-preserving data augmentation and an edge-aware GGNN to enhance code vulnerability detection accuracy and robustness.
Findings
FGVulDet outperforms static-analysis and existing learning-based methods.
The data augmentation improves detection for scarce vulnerability types.
Edge-aware GGNN captures program semantics more effectively.
Abstract
Source code vulnerability detection aims to identify inherent vulnerabilities to safeguard software systems from potential attacks. Many prior studies overlook diverse vulnerability characteristics, simplifying the problem into a binary (0-1) classification task for example determining whether it is vulnerable or not. This poses a challenge for a single deep learning-based model to effectively learn the wide array of vulnerability characteristics. Furthermore, due to the challenges associated with collecting large-scale vulnerability data, these detectors often overfit limited training datasets, resulting in lower model generalization performance. To address the aforementioned challenges, in this work, we introduce a fine-grained vulnerability detector namely FGVulDet. Unlike previous approaches, FGVulDet employs multiple classifiers to discern characteristics of various vulnerability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Advanced Malware Detection Techniques · Web Application Security Vulnerabilities
