Demonstration of an Adversarial Attack Against a Multimodal Vision Language Model for Pathology Imaging
Poojitha Thota, Jai Prakash Veerla, Partha Sai Guttikonda, Mohammad S., Nasr, Shirin Nilizadeh, Jacob M. Luber

TL;DR
This paper demonstrates that a medical vision-language model is highly vulnerable to adversarial attacks, achieving a 100% success rate in inducing misclassifications, highlighting critical security concerns in medical AI applications.
Contribution
It introduces the first targeted adversarial attack on a pathology vision-language model, revealing its susceptibility and emphasizing the need for robust defenses in medical AI.
Findings
100% success rate in adversarial manipulation
Insights into interpretability challenges of adversarial examples
Highlighting the importance of robustness in medical vision-language models
Abstract
In the context of medical artificial intelligence, this study explores the vulnerabilities of the Pathology Language-Image Pretraining (PLIP) model, a Vision Language Foundation model, under targeted attacks. Leveraging the Kather Colon dataset with 7,180 H&E images across nine tissue types, our investigation employs Projected Gradient Descent (PGD) adversarial perturbation attacks to induce misclassifications intentionally. The outcomes reveal a 100% success rate in manipulating PLIP's predictions, underscoring its susceptibility to adversarial perturbations. The qualitative analysis of adversarial examples delves into the interpretability challenges, shedding light on nuanced changes in predictions induced by adversarial manipulations. These findings contribute crucial insights into the interpretability, domain adaptation, and trustworthiness of Vision Language Models in medical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Adversarial Robustness in Machine Learning
