Demonstration of an Adversarial Attack Against a Multimodal Vision   Language Model for Pathology Imaging

Poojitha Thota; Jai Prakash Veerla; Partha Sai Guttikonda; Mohammad S.; Nasr; Shirin Nilizadeh; Jacob M. Luber

arXiv:2401.02565·eess.IV·May 9, 2024·1 cites

Demonstration of an Adversarial Attack Against a Multimodal Vision Language Model for Pathology Imaging

Poojitha Thota, Jai Prakash Veerla, Partha Sai Guttikonda, Mohammad S., Nasr, Shirin Nilizadeh, Jacob M. Luber

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that a medical vision-language model is highly vulnerable to adversarial attacks, achieving a 100% success rate in inducing misclassifications, highlighting critical security concerns in medical AI applications.

Contribution

It introduces the first targeted adversarial attack on a pathology vision-language model, revealing its susceptibility and emphasizing the need for robust defenses in medical AI.

Findings

01

100% success rate in adversarial manipulation

02

Insights into interpretability challenges of adversarial examples

03

Highlighting the importance of robustness in medical vision-language models

Abstract

In the context of medical artificial intelligence, this study explores the vulnerabilities of the Pathology Language-Image Pretraining (PLIP) model, a Vision Language Foundation model, under targeted attacks. Leveraging the Kather Colon dataset with 7,180 H&E images across nine tissue types, our investigation employs Projected Gradient Descent (PGD) adversarial perturbation attacks to induce misclassifications intentionally. The outcomes reveal a 100% success rate in manipulating PLIP's predictions, underscoring its susceptibility to adversarial perturbations. The qualitative analysis of adversarial examples delves into the interpretability challenges, shedding light on nuanced changes in predictions induced by adversarial manipulations. These findings contribute crucial insights into the interpretability, domain adaptation, and trustworthiness of Vision Language Models in medical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jaiprakash1824/vlm_adv_attack
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Adversarial Robustness in Machine Learning