TL;DR
This large-scale empirical study assesses the security vulnerabilities of nearly 15,000 custom GPTs in the OpenAI ecosystem, revealing widespread weaknesses and highlighting the need for improved security measures.
Contribution
It provides the first large-scale, empirical analysis of custom GPT vulnerabilities, quantifying risks and identifying prevalent security issues in real-world deployments.
Findings
Over 95% of custom GPTs lack adequate security protections.
High prevalence of roleplay-based vulnerabilities, system prompt leakage, and phishing.
Inherent security weaknesses in foundational models are often inherited or amplified.
Abstract
Millions of users leverage generative pretrained transformer (GPT)-based language models developed by leading model providers for a wide range of tasks. To support enhanced user interaction and customization, many platforms-such as OpenAI-now enable developers to create and publish tailored model instances, known as custom GPTs, via dedicated repositories or application stores. These custom GPTs empower users to browse and interact with specialized applications designed to meet specific needs. However, as custom GPTs see growing adoption, concerns regarding their security vulnerabilities have intensified. Existing research on these vulnerabilities remains largely theoretical, often lacking empirical, large-scale, and statistically rigorous assessments of associated risks. In this study, we analyze 14,904 custom GPTs to assess their susceptibility to seven exploitable threats, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
