LLMs in Web Development: Evaluating LLM-Generated PHP Code Unveiling   Vulnerabilities and Limitations

Rebeka T\'oth; Tamas Bisztray; L\'aszl\'o Erdodi

arXiv:2404.14459·cs.SE·May 22, 2024·3 cites

LLMs in Web Development: Evaluating LLM-Generated PHP Code Unveiling Vulnerabilities and Limitations

Rebeka T\'oth, Tamas Bisztray, L\'aszl\'o Erdodi

PDF

Open Access 1 Repo

TL;DR

This paper assesses the security vulnerabilities in PHP code generated by GPT-4 for web applications, revealing significant risks like SQL injection and XSS, and emphasizing the need for rigorous testing of AI-generated code.

Contribution

It provides a comprehensive evaluation of security flaws in GPT-4 generated PHP code, highlighting prevalent vulnerabilities and offering a dataset for further research.

Findings

01

26% of sites had exploitable vulnerabilities

02

78% of file upload scenarios were insecure

03

11.56% of sites could be compromised directly

Abstract

This study evaluates the security of web application code generated by Large Language Models, analyzing 2,500 GPT-4 generated PHP websites. These were deployed in Docker containers and tested for vulnerabilities using a hybrid approach of Burp Suite active scanning, static analysis, and manual review. Our investigation focuses on identifying Insecure File Upload, SQL Injection, Stored XSS, and Reflected XSS in GPT-4 generated PHP code. This analysis highlights potential security risks and the implications of deploying such code in real-world scenarios. Overall, our analysis found 2,440 vulnerable parameters. According to Burp's Scan, 11.56% of the sites can be straight out compromised. Adding static scan results, 26% had at least one vulnerability that can be exploited through web interaction. Certain coding scenarios, like file upload functionality, are insecure 78% of the time,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

beckyntosh/chatphp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Wikis in Education and Collaboration · Digital Rights Management and Security

MethodsAttention Is All You Need · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Absolute Position Encodings · Dropout · Dense Connections · Label Smoothing · Residual Connection · Softmax · Adam