LLM vs. SAST: A Technical Analysis on Detecting Coding Bugs of GPT4-Advanced Data Analysis

Madjid G. Tehrani; Eldar Sultanow; William J. Buchanan; Mahkame Houmani; Christel H. Djaha Fodja

arXiv:2506.15212·cs.CR·June 19, 2025

LLM vs. SAST: A Technical Analysis on Detecting Coding Bugs of GPT4-Advanced Data Analysis

Madjid G. Tehrani, Eldar Sultanow, William J. Buchanan, Mahkame Houmani, Christel H. Djaha Fodja

PDF

Open Access

TL;DR

This paper compares GPT-4's effectiveness to traditional SAST tools in detecting software vulnerabilities, demonstrating GPT-4's superior accuracy and discussing security considerations for LLMs in vulnerability scanning.

Contribution

It provides a comprehensive analysis of GPT-4's capabilities in security vulnerability detection, highlighting its outperformance over SAST and addressing security best practices.

Findings

01

GPT-4 achieves 94% accuracy in vulnerability detection.

02

GPT-4 outperforms SAST across 32 vulnerability types.

03

Security concerns of LLMs are discussed with recommended best practices.

Abstract

With the rapid advancements in Natural Language Processing (NLP), large language models (LLMs) like GPT-4 have gained significant traction in diverse applications, including security vulnerability scanning. This paper investigates the efficacy of GPT-4 in identifying software vulnerabilities compared to traditional Static Application Security Testing (SAST) tools. Drawing from an array of security mistakes, our analysis underscores the potent capabilities of GPT-4 in LLM-enhanced vulnerability scanning. We unveiled that GPT-4 (Advanced Data Analysis) outperforms SAST by an accuracy of 94% in detecting 32 types of exploitable vulnerabilities. This study also addresses the potential security concerns surrounding LLMs, emphasising the imperative of security by design/default and other security best practices for AI.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Storage Technologies