LLM-CSEC: Empirical Evaluation of Security in C/C++ Code Generated by Large Language Models

Muhammad Usman Shahid; Chuadhry Mujeeb Ahmed; Rajiv Ranjan

arXiv:2511.18966·cs.AI·November 25, 2025

LLM-CSEC: Empirical Evaluation of Security in C/C++ Code Generated by Large Language Models

Muhammad Usman Shahid, Chuadhry Mujeeb Ahmed, Rajiv Ranjan

PDF

Open Access

TL;DR

This paper evaluates the security of C/C++ code generated by large language models, revealing prevalent vulnerabilities and emphasizing the need for caution and further research in automated code generation.

Contribution

It provides an empirical analysis of vulnerabilities in LLM-generated C/C++ code, mapping CWEs to CVEs and highlighting security concerns.

Findings

01

High prevalence of CWEs in AI-generated code

02

Static analysis reveals significant security vulnerabilities

03

Calls for improved safety measures in code generation

Abstract

The security of code generated by large language models (LLMs) is a significant concern, as studies indicate that such code often contains vulnerabilities and lacks essential defensive programming constructs. This work focuses on examining and evaluating the security of LLM-generated code, particularly in the context of C/C++. We categorized known vulnerabilities using the Common Weakness Enumeration (CWE) and, to study their criticality, mapped them to CVEs. We used ten different LLMs for code generation and analyzed the outputs through static analysis. The amount of CWEs present in AI-generated code is concerning. Our findings highlight the need for developers to be cautious when using LLM-generated code. This study provides valuable insights to advance automated code generation and encourage further research in this domain.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity and Verification in Computing · Advanced Malware Detection Techniques · Information and Cyber Security