Leveraging Self-Paced Learning for Software Vulnerability Detection
Zeru Cheng, Yanjing Yang, He Zhang, Lanxin Yang, Jinghao Hu, Jinwei Xu, Bohan Liu, Haifeng Shen

TL;DR
This paper introduces SPLVD, a self-paced learning method that improves software vulnerability detection by dynamically selecting training data, leading to higher accuracy and practical effectiveness on benchmark and real-world datasets.
Contribution
The paper proposes SPLVD, a novel self-paced learning approach that enhances vulnerability detection accuracy by prioritizing easier source code during training.
Findings
SPLVD achieves state-of-the-art F1 scores of 89.2%, 68.7%, and 43.5% on benchmark datasets.
SPLVD attains a high precision of 90.9% on real-world projects from OpenHarmony.
The approach outperforms existing methods in vulnerability detection accuracy.
Abstract
Software vulnerabilities are major risks to software systems. Recently, researchers have proposed many deep learning approaches to detect software vulnerabilities. However, their accuracy is limited in practice. One of the main causes is low-quality training data (i.e., source code). To this end, we propose a new approach: SPLVD (Self-Paced Learning for Software Vulnerability Detection). SPLVD dynamically selects source code for model training based on the stage of training, which simulates the human learning process progressing from easy to hard. SPLVD has a data selector that is specifically designed for the vulnerability detection task, which enables it to prioritize the learning of easy source code. Before each training epoch, SPLVD uses the data selector to recalculate the difficulty of the source code, select new training source code, and update the data selector. When evaluating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Information and Cyber Security · Software Testing and Debugging Techniques
