Privacy and Security Threat for OpenAI GPTs
Wei Wenying, Zhao Kaifa, Xue Lei, Fan Ming

TL;DR
This paper uncovers significant privacy and security vulnerabilities in OpenAI's GPT ecosystem, demonstrating widespread instruction leaking and data access issues in real-world custom GPTs, emphasizing the need for improved defenses.
Contribution
It systematically evaluates the scope of security and privacy threats in custom GPTs, developing attack methods and a framework to assess defense strategies in a large-scale real-world setting.
Findings
Over 98.8% of GPTs are vulnerable to instruction leaking attacks
Half of the GPTs can be attacked through multiround conversations
77.5% of GPTs with defenses remain vulnerable to basic attacks
Abstract
Large language models (LLMs) demonstrate powerful information handling capabilities and are widely integrated into chatbot applications. OpenAI provides a platform for developers to construct custom GPTs, extending ChatGPT's functions and integrating external services. Since its release in November 2023, over 3 million custom GPTs have been created. However, such a vast ecosystem also conceals security and privacy threats. For developers, instruction leaking attacks threaten the intellectual property of instructions in custom GPTs through carefully crafted adversarial prompts. For users, unwanted data access behavior by custom GPTs or integrated third-party services raises significant privacy concerns. To systematically evaluate the scope of threats in real-world LLM applications, we develop three phases instruction leaking attacks target GPTs with different defense level. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSecurity and Verification in Computing · AI in Service Interactions · Advanced Malware Detection Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Cosine Annealing · Linear Layer · Layer Normalization · Adam · Dense Connections · Linear Warmup With Cosine Annealing · Attention Dropout · Softmax · Weight Decay
