LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models
Mohamad Fakih, Rahul Dharmaji, Halima Bouzidi, Gustavo Quiros Araya,, Oluwatosin Ogundare, Mohammad Abdullah Al Faruque

TL;DR
This paper introduces LLM4CVE, an iterative pipeline utilizing large language models to automatically and effectively repair software vulnerabilities in real-world code, demonstrating high accuracy and improved code similarity.
Contribution
It presents a novel LLM-based iterative approach for vulnerability repair, including implementation details, evaluation with multiple LLMs, and publicly available resources for future research.
Findings
Achieved a human-verified quality score of 8.51/10
Increased groundtruth code similarity by 20% with Llama 3 70B
Demonstrated effectiveness across multiple state-of-the-art LLMs
Abstract
Software vulnerabilities continue to be ubiquitous, even in the era of AI-powered code assistants, advanced static analysis tools, and the adoption of extensive testing frameworks. It has become apparent that we must not simply prevent these bugs, but also eliminate them in a quick, efficient manner. Yet, human code intervention is slow, costly, and can often lead to further security vulnerabilities, especially in legacy codebases. The advent of highly advanced Large Language Models (LLM) has opened up the possibility for many software defects to be patched automatically. We propose LLM4CVE an LLM-based iterative pipeline that robustly fixes vulnerable functions in real-world code with high accuracy. We examine our pipeline with State-of-the-Art LLMs, such as GPT-3.5, GPT-4o, Llama 38B, and Llama 3 70B. We achieve a human-verified quality score of 8.51/10 and an increase in groundtruth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Application Security Vulnerabilities · Software System Performance and Reliability · Network Security and Intrusion Detection
