Vulnerability-Aware Robust Multimodal Adversarial Training
Junrui Zhang, Xinyu Zhao, Jie Peng, Chenjie Wang, Jianmin Ji, Tianlong Chen

TL;DR
This paper introduces VARMAT, a vulnerability-aware adversarial training method that enhances multimodal model robustness by explicitly identifying and penalizing vulnerable modalities during training.
Contribution
The paper proposes a novel vulnerability quantification and targeted regularization approach for robust multimodal training, addressing limitations of existing methods.
Findings
Achieves up to 22.21% robustness improvement on multiple datasets.
Effectively identifies modality vulnerabilities to improve robustness.
Demonstrates significant robustness gains over baseline methods.
Abstract
Multimodal learning has shown significant superiority on various tasks by integrating multiple modalities. However, the interdependencies among modalities increase the susceptibility of multimodal models to adversarial attacks. Existing methods mainly focus on attacks on specific modalities or indiscriminately attack all modalities. In this paper, we find that these approaches ignore the differences between modalities in their contribution to final robustness, resulting in suboptimal robustness performance. To bridge this gap, we introduce Vulnerability-Aware Robust Multimodal Adversarial Training (VARMAT), a probe-in-training adversarial training method that improves multimodal robustness by identifying the vulnerability of each modality. To be specific, VARMAT first explicitly quantifies the vulnerability of each modality, grounded in a first-order approximation of the attack…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Hate Speech and Cyberbullying Detection · Advanced Neural Network Applications
