AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing

Sen Fang; Weiyuan Ding; Zhezhen Cao; Zhou Yang; Bowen Xu

arXiv:2603.20637·cs.SE·March 24, 2026

AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing

Sen Fang, Weiyuan Ding, Zhezhen Cao, Zhou Yang, Bowen Xu

PDF

Open Access

TL;DR

AEGIS introduces a grounded, multi-agent framework for vulnerability detection that enhances verification accuracy by reconstructing dependency chains and scrutinizing claims, surpassing previous benchmarks with minimal cost.

Contribution

This work presents AEGIS, a novel multi-agent system that grounds vulnerability reasoning in a factual code repository, improving accuracy and reducing false positives without task-specific training.

Findings

01

Achieved state-of-the-art performance on PrimeVul with 122 correct predictions.

02

Reduced false positive rate by up to 54.40%.

03

Operates at an average cost of $0.09 per sample.

Abstract

Large Language Models (LLMs) are increasingly adopted for vulnerability detection, yet their reasoning remains fundamentally unsound. We identify a root cause shared by both major mitigation paradigms (agent-based debate and retrieval augmentation): reasoning in an ungrounded deliberative space that lacks a bounded, hypothesis-specific evidence base. Without such grounding, agents fabricate cross-function dependencies, and retrieval heuristics supply generic knowledge decoupled from the repository's data-flow topology. Consequently, the resulting conclusions are driven by rhetorical persuasiveness rather than verifiable facts. To ground this deliberation, we present AEGIS, a novel multi-agent framework that shifts detection from ungrounded speculation to forensic verification over a closed factual substrate. Guided by a "From Clue to Verdict" philosophy, AEGIS first identifies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Authorship Attribution and Profiling · Software Engineering Research