An Investigation into Misuse of Java Security APIs by Large Language   Models

Zahra Mousavi; Chadni Islam; Kristen Moore; Alsharif Abuadbba,; Muhammad Ali Babar

arXiv:2404.03823·cs.CR·April 8, 2024·1 cites

An Investigation into Misuse of Java Security APIs by Large Language Models

Zahra Mousavi, Chadni Islam, Kristen Moore, Alsharif Abuadbba,, Muhammad Ali Babar

PDF

Open Access 1 Repo

TL;DR

This study evaluates ChatGPT's ability to correctly generate secure Java code using security APIs, revealing a high rate of API misuse that compromises software security.

Contribution

It provides a systematic assessment of ChatGPT's trustworthiness in security API code generation, highlighting prevalent misuse issues and identifying specific vulnerabilities.

Findings

01

Approximately 70% of generated code misuses security APIs

02

20 distinct types of API misuse were identified

03

Misuse rate reaches 100% in half of the evaluated tasks

Abstract

The increasing trend of using Large Language Models (LLMs) for code generation raises the question of their capability to generate trustworthy code. While many researchers are exploring the utility of code generation for uncovering software vulnerabilities, one crucial but often overlooked aspect is the security Application Programming Interfaces (APIs). APIs play an integral role in upholding software security, yet effectively integrating security APIs presents substantial challenges. This leads to inadvertent misuse by developers, thereby exposing software to vulnerabilities. To overcome these challenges, developers may seek assistance from LLMs. In this paper, we systematically assess ChatGPT's trustworthiness in code generation for security API use cases in Java. To conduct a thorough evaluation, we compile an extensive collection of 48 programming tasks for 5 widely used security…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

llm-security-study/chatgpt
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Advanced Malware Detection Techniques · Network Security and Intrusion Detection