Multi-role Consensus through LLMs Discussions for Vulnerability   Detection

Zhenyu Mao; Jialong Li; Dongming Jin; Munan Li; Kenji Tei

arXiv:2403.14274·cs.SE·May 21, 2024·1 cites

Multi-role Consensus through LLMs Discussions for Vulnerability Detection

Zhenyu Mao, Jialong Li, Dongming Jin, Munan Li, Kenji Tei

PDF

Open Access 1 Repo

TL;DR

This paper proposes a multi-role LLM-based discussion framework simulating real-life code review roles to improve vulnerability detection, achieving significant gains in precision, recall, and F1 score.

Contribution

It introduces a novel multi-role discussion approach with LLMs for vulnerability detection, incorporating diverse perspectives from developers and testers.

Findings

01

13.48% increase in precision

02

18.25% increase in recall

03

16.13% increase in F1 score

Abstract

Recent advancements in large language models (LLMs) have highlighted the potential for vulnerability detection, a crucial component of software quality assurance. Despite this progress, most studies have been limited to the perspective of a single role, usually testers, lacking diverse viewpoints from different roles in a typical software development life-cycle, including both developers and testers. To this end, this paper introduces a multi-role approach to employ LLMs to act as different roles simulating a real-life code review process and engaging in discussions toward a consensus on the existence and classification of vulnerabilities in the code. Preliminary evaluation of this approach indicates a 13.48% increase in the precision rate, an 18.25% increase in the recall rate, and a 16.13% increase in the F1 score.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rockmao45/llmvulndetection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Web Application Security Vulnerabilities