Defending Multimodal Fusion Models against Single-Source Adversaries
Karren Yang, Wan-Yi Lin, Manash Barman, Filipe Condessa, Zico Kolter

TL;DR
This paper reveals that multimodal neural networks are vulnerable to adversarial attacks on a single modality, and proposes a robust fusion strategy that detects and mitigates such attacks, significantly improving robustness across various tasks.
Contribution
The paper introduces a novel adversarially robust fusion method that enhances single-source robustness in multimodal models without sacrificing performance on clean data.
Findings
Standard multimodal models are vulnerable to single-source adversarial attacks.
The proposed method improves robustness by detecting and ignoring inconsistent modalities.
Significant performance gains on multiple tasks demonstrate effectiveness.
Abstract
Beyond achieving high performance across many vision tasks, multimodal models are expected to be robust to single-source faults due to the availability of redundant information between modalities. In this paper, we investigate the robustness of multimodal neural networks against worst-case (i.e., adversarial) perturbations on a single modality. We first show that standard multimodal fusion models are vulnerable to single-source adversaries: an attack on any single modality can overcome the correct information from multiple unperturbed modalities and cause the model to fail. This surprising vulnerability holds across diverse multimodal tasks and necessitates a solution. Motivated by this finding, we propose an adversarially robust fusion strategy that trains the model to compare information coming from all the input sources, detect inconsistencies in the perturbed modality compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research
