When One Modality Rules Them All: Backdoor Modality Collapse in Multimodal Diffusion Models

Qitong Wang; Haoran Dai; Haotian Zhang; Christopher Rasmussen; Binghui Wang

arXiv:2603.06508·cs.LG·March 9, 2026

When One Modality Rules Them All: Backdoor Modality Collapse in Multimodal Diffusion Models

Qitong Wang, Haoran Dai, Haotian Zhang, Christopher Rasmussen, Binghui Wang

PDF

Open Access

TL;DR

This paper uncovers a phenomenon in multimodal diffusion models where backdoor attacks tend to rely on only a subset of modalities, revealing a vulnerability that contradicts the assumption of combined modality effects.

Contribution

The paper introduces two novel metrics to quantify backdoor modality reliance and demonstrates the prevalence of modality collapse in multimodal diffusion models through extensive experiments.

Findings

01

Backdoor attacks often collapse into subset-modality dominance.

02

Cross-modal interaction in backdoor behavior is negligible or negative.

03

High attack success rates can mask reliance on a single modality.

Abstract

While diffusion models have revolutionized visual content generation, their rapid adoption has underscored the critical need to investigate vulnerabilities, e.g., to backdoor attacks. In multimodal diffusion models, it is natural to expect that attacking multiple modalities simultaneously (e.g., text and image) would yield complementary effects and strengthen the overall backdoor. In this paper, we challenge this assumption by investigating the phenomenon of Backdoor Modality Collapse, a scenario where the backdoor mechanism degenerates to rely predominantly on a subset of modalities, rendering others redundant. To rigorously quantify this behavior, we introduce two novel metrics: Trigger Modality Attribution (TMA) and Cross-Trigger Interaction (CTI). Through extensive experiments across diverse training configurations in multimodal conditional diffusion, we consistently observe a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Artificial Intelligence in Games · Advanced Malware Detection Techniques