AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing
Sen Fang, Weiyuan Ding, Zhezhen Cao, Zhou Yang, Bowen Xu

TL;DR
AEGIS introduces a grounded, multi-agent framework for vulnerability detection that enhances verification accuracy by reconstructing dependency chains and scrutinizing claims, surpassing previous benchmarks with minimal cost.
Contribution
This work presents AEGIS, a novel multi-agent system that grounds vulnerability reasoning in a factual code repository, improving accuracy and reducing false positives without task-specific training.
Findings
Achieved state-of-the-art performance on PrimeVul with 122 correct predictions.
Reduced false positive rate by up to 54.40%.
Operates at an average cost of $0.09 per sample.
Abstract
Large Language Models (LLMs) are increasingly adopted for vulnerability detection, yet their reasoning remains fundamentally unsound. We identify a root cause shared by both major mitigation paradigms (agent-based debate and retrieval augmentation): reasoning in an ungrounded deliberative space that lacks a bounded, hypothesis-specific evidence base. Without such grounding, agents fabricate cross-function dependencies, and retrieval heuristics supply generic knowledge decoupled from the repository's data-flow topology. Consequently, the resulting conclusions are driven by rhetorical persuasiveness rather than verifiable facts. To ground this deliberation, we present AEGIS, a novel multi-agent framework that shifts detection from ungrounded speculation to forensic verification over a closed factual substrate. Guided by a "From Clue to Verdict" philosophy, AEGIS first identifies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Authorship Attribution and Profiling · Software Engineering Research
