PenHeal: A Two-Stage LLM Framework for Automated Pentesting and Optimal Remediation
Junjie Huang, Quanyan Zhu

TL;DR
PenHeal is a novel two-stage LLM framework that automates penetration testing and vulnerability remediation, significantly improving coverage, effectiveness, and reducing costs in cybersecurity defenses.
Contribution
This paper introduces PenHeal, the first integrated LLM-based system for automated vulnerability detection and remediation in cybersecurity.
Findings
Vulnerability coverage increased by 31%.
Remediation effectiveness improved by 32%.
Cost reduction of 46% compared to baselines.
Abstract
Recent advances in Large Language Models (LLMs) have shown significant potential in enhancing cybersecurity defenses against sophisticated threats. LLM-based penetration testing is an essential step in automating system security evaluations by identifying vulnerabilities. Remediation, the subsequent crucial step, addresses these discovered vulnerabilities. Since details about vulnerabilities, exploitation methods, and software versions offer crucial insights into system weaknesses, integrating penetration testing with vulnerability remediation into a cohesive system has become both intuitive and necessary. This paper introduces PenHeal, a two-stage LLM-based framework designed to autonomously identify and mitigate security vulnerabilities. The framework integrates two LLM-enabled components: the Pentest Module, which detects multiple vulnerabilities within a system, and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGroundwater flow and contamination studies · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
