Prompt Injection Attacks on Large Language Models in Oncology

Jan Clusmann; Dyke Ferber; Isabella C. Wiest; Carolin V. Schneider,; Titus J. Brinker; Sebastian Foersch; Daniel Truhn; Jakob N. Kather

arXiv:2407.18981·cs.CR·March 20, 2025·5 cites

Prompt Injection Attacks on Large Language Models in Oncology

Jan Clusmann, Dyke Ferber, Isabella C. Wiest, Carolin V. Schneider,, Titus J. Brinker, Sebastian Foersch, Daniel Truhn, Jakob N. Kather

PDF

Open Access

TL;DR

This paper reveals that current vision-language models used in healthcare, especially in oncology, are vulnerable to prompt injection attacks that can cause them to output harmful information without access to their internal parameters.

Contribution

The study provides a comprehensive quantitative analysis of prompt injection vulnerabilities in four state-of-the-art medical vision-language models, highlighting a critical security flaw.

Findings

01

All four models are susceptible to prompt injection attacks.

02

Embedding sub-visual prompts can induce harmful outputs.

03

Attacks are non-obvious to human observers.

Abstract

Vision-language artificial intelligence models (VLMs) possess medical knowledge and can be employed in healthcare in numerous ways, including as image interpreters, virtual scribes, and general decision support systems. However, here, we demonstrate that current VLMs applied to medical tasks exhibit a fundamental security flaw: they can be attacked by prompt injection attacks, which can be used to output harmful information just by interacting with the VLM, without any access to its parameters. We performed a quantitative study to evaluate the vulnerabilities to these attacks in four state of the art VLMs which have been proposed to be of utility in healthcare: Claude 3 Opus, Claude 3.5 Sonnet, Reka Core, and GPT-4o. Using a set of N=297 attacks, we show that all of these models are susceptible. Specifically, we show that embedding sub-visual prompts in medical imaging data can cause…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning in Healthcare · Natural Language Processing Techniques

MethodsSparse Evolutionary Training