Fixing Security Vulnerabilities with AI in OSS-Fuzz

Yuntong Zhang; Jiawei Wang; Dominic Berzin; Martin Mirchev; and Dongge Liu; Abhishek Arya; Oliver Chang; Abhik Roychoudhury

arXiv:2411.03346·cs.CR·November 22, 2024

Fixing Security Vulnerabilities with AI in OSS-Fuzz

Yuntong Zhang, Jiawei Wang, Dominic Berzin, Martin Mirchev, and Dongge Liu, Abhishek Arya, Oliver Chang, Abhik Roychoudhury

PDF

Open Access

TL;DR

This paper explores using AI, specifically LLM agents like AutoCodeRover, to automate security vulnerability fixing in open source software validated by OSS-Fuzz, emphasizing dynamic testing over static code similarity for patch correctness.

Contribution

It demonstrates the adaptation of LLM agents for security patching using exploit test executions, highlighting the limitations of code similarity metrics in ensuring patch correctness.

Findings

01

LLM agents can effectively automate security vulnerability patches.

02

Code similarity metrics like CodeBLEU are insufficient for patch quality assessment.

03

Dynamic test execution is crucial for verifying security patch correctness.

Abstract

Critical open source software systems undergo significant validation in the form of lengthy fuzz campaigns. The fuzz campaigns typically conduct a biased random search over the domain of program inputs, to find inputs which crash the software system. Such fuzzing is useful to enhance the security of software systems in general since even closed source software may use open source components. Hence testing open source software is of paramount importance. Currently OSS-Fuzz is the most significant and widely used infrastructure for continuous validation of open source systems. Unfortunately even though OSS-Fuzz has identified more than 10,000 vulnerabilities across 1000 or more software projects, the detected vulnerabilities may remain unpatched, as vulnerability fixing is often manual in practice. In this work, we rely on the recent progress in Large Language Model (LLM) agents for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation and Cyber Security · Advanced Malware Detection Techniques · Network Security and Intrusion Detection