Score: 0

LLM-CSEC: Empirical Evaluation of Security in C/C++ Code Generated by Large Language Models

Published: November 24, 2025 | arXiv ID: 2511.18966v1

By: Muhammad Usman Shahid, Chuadhry Mujeeb Ahmed, Rajiv Ranjan

Potential Business Impact:

Finds security problems in computer code made by AI.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

The security of code generated by large language models (LLMs) is a significant concern, as studies indicate that such code often contains vulnerabilities and lacks essential defensive programming constructs. This work focuses on examining and evaluating the security of LLM-generated code, particularly in the context of C/C++. We categorized known vulnerabilities using the Common Weakness Enumeration (CWE) and, to study their criticality, mapped them to CVEs. We used ten different LLMs for code generation and analyzed the outputs through static analysis. The amount of CWEs present in AI-generated code is concerning. Our findings highlight the need for developers to be cautious when using LLM-generated code. This study provides valuable insights to advance automated code generation and encourage further research in this domain.

The Hidden Risks of LLM-Generated Web Application Code: A Security-Centric Evaluation of Code Generation Capabilities in Large Language Models

Cryptography and Security

Finds security flaws in computer code made by AI.

29 Apr 2025 0

91%

WildCode: An Empirical Analysis of Code Generated by ChatGPT

Cryptography and Security

AI-written code is often unsafe for computers.

3 Dec 2025 2

90%

Assessing the Quality and Security of AI-Generated Code: A Quantitative Analysis

Software Engineering

Finds bugs and security risks in AI-written code.

20 Aug 2025 0

View PDF Login to Bookmark

Page Count

13 pages

LLM-CSEC: Empirical Evaluation of Security in C/C++ Code Generated by Large Language Models

Finds security problems in computer code made by AI.

Technical Abstract

The Hidden Risks of LLM-Generated Web Application Code: A Security-Centric Evaluation of Code Generation Capabilities in Large Language Models

WildCode: An Empirical Analysis of Code Generated by ChatGPT

Assessing the Quality and Security of AI-Generated Code: A Quantitative Analysis