VFArch\=e: A Dual-Mode Framework for Locating Vulnerable Functions in Open-Source Software
Lyuye Zhang, Jian Zhang, Kaixuan Li, Chong Wang, Chengwei Liu, Jiahui Wu, Sen Chen, Yaowen Zheng, Yang Liu

TL;DR
VFArch extasciitilde e is a dual-mode framework that effectively locates vulnerable functions in open-source software, handling cases with or without available patches, and improves vulnerability detection accuracy.
Contribution
The paper introduces VFArch extasciitilde e, a novel dual-mode approach that automatically localizes vulnerable functions in OSS regardless of patch availability, outperforming existing methods.
Findings
Achieves 1.3x and 1.9x higher MRR over baselines in patch-present and patch-absent modes.
Successfully locates vulnerable functions for 43 out of 50 recent vulnerabilities.
Reduces false positives of SCA tools by 78-89%.
Abstract
Software Composition Analysis (SCA) has become pivotal in addressing vulnerabilities inherent in software project dependencies. In particular, reachability analysis is increasingly used in Open-Source Software (OSS) projects to identify reachable vulnerabilities (e.g., CVEs) through call graphs, enabling a focus on exploitable risks. Performing reachability analysis typically requires the vulnerable function (VF) to track the call chains from downstream applications. However, such crucial information is usually unavailable in modern vulnerability databases like NVD. While directly extracting VF from modified functions in vulnerability patches is intuitive, patches are not always available. Moreover, our preliminary study shows that over 26% of VF do not exist in the modified functions. Meanwhile, simply ignoring patches to search vulnerable functions suffers from overwhelming noises and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
