Multi-Granularity Detector for Vulnerability Fixes

Truong Giang Nguyen; Thanh Le-Cong; Hong Jin Kang; Ratnadira; Widyasari; Chengran Yang; Zhipeng Zhao; Bowen Xu; Jiayuan Zhou; Xin Xia,; Ahmed E. Hassan; Xuan-Bach D. Le; David Lo

arXiv:2305.13884·cs.CR·May 24, 2023·2 cites

Multi-Granularity Detector for Vulnerability Fixes

Truong Giang Nguyen, Thanh Le-Cong, Hong Jin Kang, Ratnadira, Widyasari, Chengran Yang, Zhipeng Zhao, Bowen Xu, Jiayuan Zhou, Xin Xia,, Ahmed E. Hassan, Xuan-Bach D. Le, David Lo

PDF

Open Access 2 Repos

TL;DR

MiDas is a multi-granularity neural network ensemble that improves detection of vulnerability-fixing commits in open source software, outperforming existing methods by handling noisy data and imbalanced classes effectively.

Contribution

Introduces MiDas, a novel multi-granularity neural ensemble model that leverages different code change levels to enhance vulnerability fix detection accuracy.

Findings

01

Outperforms state-of-the-art baseline in AUC by 4.9% (Java) and 13.7% (Python)

02

Achieves up to 28.2% and 15.9% improvements in EffortCost@L and Popt@L (Java)

03

Achieves up to 60% and 51.4% improvements in EffortCost@L and Popt@L (Python)

Abstract

With the increasing reliance on Open Source Software, users are exposed to third-party library vulnerabilities. Software Composition Analysis (SCA) tools have been created to alert users of such vulnerabilities. SCA requires the identification of vulnerability-fixing commits. Prior works have proposed methods that can automatically identify such vulnerability-fixing commits. However, identifying such commits is highly challenging, as only a very small minority of commits are vulnerability fixing. Moreover, code changes can be noisy and difficult to analyze. We observe that noise can occur at different levels of detail, making it challenging to detect vulnerability fixes accurately. To address these challenges and boost the effectiveness of prior works, we propose MiDas (Multi-Granularity Detector for Vulnerability Fixes). Unique from prior works, Midas constructs different neural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Web Application Security Vulnerabilities

MethodsLib · Semantic Cross Attention · Balanced Selection