Featherweight Assisted Vulnerability Discovery

David Binkley; Leon Moonen; Sibren Isaacman

arXiv:2202.02679·cs.CR·February 8, 2022

Featherweight Assisted Vulnerability Discovery

David Binkley, Leon Moonen, Sibren Isaacman

PDF

1 Repo

TL;DR

This paper introduces a lightweight, frequency-based method using function name words to predict vulnerable code, providing transparency and requiring less training data than deep neural networks.

Contribution

It presents a novel, transparent frequency-based algorithm for vulnerability prediction that complements deep learning approaches and requires significantly less training data.

Findings

01

Words in function names significantly contribute to vulnerability prediction.

02

The approach is effective especially with homogeneous vocabularies.

03

It performs well with much less training data than DNNs.

Abstract

Predicting vulnerable source code helps to focus attention on those parts of the code that need to be examined with more scrutiny. Recent work proposed the use of function names as semantic cues that can be learned by a deep neural network (DNN) to aid in the hunt for vulnerability of functions. Combining identifier splitting, which splits each function name into its constituent words, with a novel frequency-based algorithm, we explore the extent to which the words that make up a function's name can predict potentially vulnerable functions. In contrast to *lightweight* predictions by a DNN that considers only function names, avoiding the use of a DNN provides *featherweight* predictions. The underlying idea is that function names that contain certain "dangerous" words are more likely to accompany vulnerable functions. Of course, this assumes that the frequency-based algorithm can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

secureit-project/favd
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.