Backdoor Attack on Unpaired Medical Image-Text Foundation Models: A   Pilot Study on MedCLIP

Ruinan Jin; Chun-Yin Huang; Chenyu You; Xiaoxiao Li

arXiv:2401.01911·cs.CV·January 5, 2024·1 cites

Backdoor Attack on Unpaired Medical Image-Text Foundation Models: A Pilot Study on MedCLIP

Ruinan Jin, Chun-Yin Huang, Chenyu You, Xiaoxiao Li

PDF

Open Access 1 Repo

TL;DR

This paper investigates the vulnerability of unpaired medical image-text foundation models, specifically MedCLIP, to backdoor attacks, revealing significant security risks and the ineffectiveness of current defenses.

Contribution

It introduces a novel backdoor attack framework targeting unpaired medical FMs and demonstrates its effectiveness against MedCLIP and other models.

Findings

01

Backdoor attacks can be launched with minimal label discrepancies.

02

Current defenses are inadequate against these backdoor threats.

03

The attack disrupts contrastive learning via embedding manipulation.

Abstract

In recent years, foundation models (FMs) have solidified their role as cornerstone advancements in the deep learning domain. By extracting intricate patterns from vast datasets, these models consistently achieve state-of-the-art results across a spectrum of downstream tasks, all without necessitating extensive computational resources. Notably, MedCLIP, a vision-language contrastive learning-based medical FM, has been designed using unpaired image-text training. While the medical domain has often adopted unpaired training to amplify data, the exploration of potential security concerns linked to this approach hasn't kept pace with its practical usage. Notably, the augmentation capabilities inherent in unpaired training also indicate that minor label discrepancies can result in significant model deviations. In this study, we frame this label discrepancy as a backdoor attack problem. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ubc-tea/backdoor_multimodal_foundation_model
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsSparse Evolutionary Training · Contrastive Learning