An Empirical Study on the Security Vulnerabilities of GPTs

Tong Wu; Weibin Wu; Zibin Zheng

arXiv:2512.00136·cs.CR·December 2, 2025

An Empirical Study on the Security Vulnerabilities of GPTs

Tong Wu, Weibin Wu, Zibin Zheng

PDF

Open Access

TL;DR

This paper empirically investigates security vulnerabilities in GPT-based systems, analyzing attack surfaces, demonstrating potential exploits, and proposing defense mechanisms to enhance their security and responsible use.

Contribution

It provides a comprehensive attack surface analysis and systematic attack suite for GPTs, along with defense strategies, filling a gap in empirical security research on large language models.

Findings

01

Identification of key security vulnerabilities in GPT components

02

Demonstration of information leakage and tool misuse attacks

03

Proposed defense mechanisms to mitigate identified vulnerabilities

Abstract

Equipped with various tools and knowledge, GPTs, one kind of customized AI agents based on OpenAI's large language models, have illustrated great potential in many fields, such as writing, research, and programming. Today, the number of GPTs has reached three millions, with the range of specific expert domains becoming increasingly diverse. However, given the consistent framework shared among these LLM agent applications, systemic security vulnerabilities may exist and remain underexplored. To fill this gap, we present an empirical study on the security vulnerabilities of GPTs. Building upon prior research on LLM security, we first adopt a platform-user perspective to conduct a comprehensive attack surface analysis across different system components. Then, we design a systematic and multidimensional attack suite with the explicit objectives of information leakage and tool misuse based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Application Security Vulnerabilities · Spam and Phishing Detection · Scientific Computing and Data Management