Multi-Granularity Detector for Vulnerability Fixes
Truong Giang Nguyen, Thanh Le-Cong, Hong Jin Kang, Ratnadira, Widyasari, Chengran Yang, Zhipeng Zhao, Bowen Xu, Jiayuan Zhou, Xin Xia,, Ahmed E. Hassan, Xuan-Bach D. Le, David Lo

TL;DR
MiDas is a multi-granularity neural network ensemble that improves detection of vulnerability-fixing commits in open source software, outperforming existing methods by handling noisy data and imbalanced classes effectively.
Contribution
Introduces MiDas, a novel multi-granularity neural ensemble model that leverages different code change levels to enhance vulnerability fix detection accuracy.
Findings
Outperforms state-of-the-art baseline in AUC by 4.9% (Java) and 13.7% (Python)
Achieves up to 28.2% and 15.9% improvements in EffortCost@L and Popt@L (Java)
Achieves up to 60% and 51.4% improvements in EffortCost@L and Popt@L (Python)
Abstract
With the increasing reliance on Open Source Software, users are exposed to third-party library vulnerabilities. Software Composition Analysis (SCA) tools have been created to alert users of such vulnerabilities. SCA requires the identification of vulnerability-fixing commits. Prior works have proposed methods that can automatically identify such vulnerability-fixing commits. However, identifying such commits is highly challenging, as only a very small minority of commits are vulnerability fixing. Moreover, code changes can be noisy and difficult to analyze. We observe that noise can occur at different levels of detail, making it challenging to detect vulnerability fixes accurately. To address these challenges and boost the effectiveness of prior works, we propose MiDas (Multi-Granularity Detector for Vulnerability Fixes). Unique from prior works, Midas constructs different neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Web Application Security Vulnerabilities
MethodsLib · Semantic Cross Attention · Balanced Selection
