LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large   Language Models

Mohamad Fakih; Rahul Dharmaji; Halima Bouzidi; Gustavo Quiros Araya,; Oluwatosin Ogundare; Mohammad Abdullah Al Faruque

arXiv:2501.03446·cs.SE·January 8, 2025

LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models

Mohamad Fakih, Rahul Dharmaji, Halima Bouzidi, Gustavo Quiros Araya,, Oluwatosin Ogundare, Mohammad Abdullah Al Faruque

PDF

Open Access

TL;DR

This paper introduces LLM4CVE, an iterative pipeline utilizing large language models to automatically and effectively repair software vulnerabilities in real-world code, demonstrating high accuracy and improved code similarity.

Contribution

It presents a novel LLM-based iterative approach for vulnerability repair, including implementation details, evaluation with multiple LLMs, and publicly available resources for future research.

Findings

01

Achieved a human-verified quality score of 8.51/10

02

Increased groundtruth code similarity by 20% with Llama 3 70B

03

Demonstrated effectiveness across multiple state-of-the-art LLMs

Abstract

Software vulnerabilities continue to be ubiquitous, even in the era of AI-powered code assistants, advanced static analysis tools, and the adoption of extensive testing frameworks. It has become apparent that we must not simply prevent these bugs, but also eliminate them in a quick, efficient manner. Yet, human code intervention is slow, costly, and can often lead to further security vulnerabilities, especially in legacy codebases. The advent of highly advanced Large Language Models (LLM) has opened up the possibility for many software defects to be patched automatically. We propose LLM4CVE an LLM-based iterative pipeline that robustly fixes vulnerable functions in real-world code with high accuracy. We examine our pipeline with State-of-the-Art LLMs, such as GPT-3.5, GPT-4o, Llama 38B, and Llama 3 70B. We achieve a human-verified quality score of 8.51/10 and an increase in groundtruth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Application Security Vulnerabilities · Software System Performance and Reliability · Network Security and Intrusion Detection