RAVEN: Retrieval-Augmented Vulnerability Exploration Network for Memory Corruption Analysis in User Code and Binary Programs

Parteek Jamwal; Minghao Shao; Boyuan Chen; Achyuta Muthuvelan; Asini Subanya; Boubacar Ballo; Kashish Satija; Mariam Shafey; Mohamed Mahmoud; Moncif Dahaji Bouffi; Pasindu Wickramasinghe; Siyona Goel; Yaakulya Sabbani; Hakim Hacid; Mthandazo Ndhlovu; Eleanna Kafeza; Sanjay Rawat; Muhammad Shafique

arXiv:2604.17948·cs.CR·April 21, 2026

RAVEN: Retrieval-Augmented Vulnerability Exploration Network for Memory Corruption Analysis in User Code and Binary Programs

Parteek Jamwal, Minghao Shao, Boyuan Chen, Achyuta Muthuvelan, Asini Subanya, Boubacar Ballo, Kashish Satija, Mariam Shafey, Mohamed Mahmoud, Moncif Dahaji Bouffi, Pasindu Wickramasinghe, Siyona Goel, Yaakulya Sabbani, Hakim Hacid, Mthandazo Ndhlovu, Eleanna Kafeza, Sanjay Rawat

PDF

TL;DR

RAVEN is a framework that uses large language models and retrieval-augmented generation to automatically produce detailed vulnerability analysis reports from source code, enhancing cybersecurity documentation.

Contribution

The paper introduces RAVEN, a novel LLM-based system with retrieval and evaluation modules for automated vulnerability report generation, addressing a gap in cybersecurity automation.

Findings

01

RAVEN achieved an average report quality score of 54.21%.

02

The framework effectively covers 15 CWE vulnerability types.

03

Evaluation on 105 samples demonstrates promising results for automated documentation.

Abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities across various cybersecurity tasks, including vulnerability classification, detection, and patching. However, their potential in automated vulnerability report documentation and analysis remains underexplored. We present RAVEN (Retrieval Augmented Vulnerability Exploration Network), a framework leveraging LLM agents and Retrieval Augmented Generation (RAG) to synthesize comprehensive vulnerability analysis reports. Given vulnerable source code, RAVEN generates reports following the Google Project Zero Root Cause Analysis template. The framework uses four modules: an Explorer agent for vulnerability identification, a RAG engine retrieving relevant knowledge from curated databases including Google Project Zero reports and CWE entries, an Analyst agent for impact and exploitation assessment, and a Reporter agent for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.