Exploring ChatGPT's Capabilities on Vulnerability Management
Peiyu Liu, Junming Liu, Lirong Fu, Kangjie Lu, Yifan Xia, Xuhong, Zhang, Wenzhi Chen, Haiqin Weng, Shouling Ji, Wenhai Wang

TL;DR
This paper evaluates ChatGPT's ability to perform complex vulnerability management tasks, comparing its performance to state-of-the-art methods, and identifies challenges and future research directions for improving its effectiveness in security applications.
Contribution
It is the first comprehensive study assessing ChatGPT's capabilities across multiple vulnerability management tasks using a large dataset, highlighting its potential and limitations.
Findings
ChatGPT shows promising potential in generating bug report titles.
Random demonstration examples do not guarantee consistent performance.
Extracting expertise from demonstrations and guiding ChatGPT effectively are promising future directions.
Abstract
Recently, ChatGPT has attracted great attention from the code analysis domain. Prior works show that ChatGPT has the capabilities of processing foundational code analysis tasks, such as abstract syntax tree generation, which indicates the potential of using ChatGPT to comprehend code syntax and static behaviors. However, it is unclear whether ChatGPT can complete more complicated real-world vulnerability management tasks, such as the prediction of security relevance and patch correctness, which require an all-encompassing understanding of various aspects, including code syntax, program semantics, and related manual comments. In this paper, we explore ChatGPT's capabilities on 6 tasks involving the complete vulnerability management process with a large-scale dataset containing 70,346 samples. For each task, we compare ChatGPT against SOTA approaches, investigate the impact of different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Advanced Malware Detection Techniques
MethodsFocus
