Backdoor Attacks on Multi-modal Contrastive Learning

Simi D Kuniyilh; Rita Machacy

arXiv:2601.11006·cs.LG·January 19, 2026

Backdoor Attacks on Multi-modal Contrastive Learning

Simi D Kuniyilh, Rita Machacy

PDF

Open Access

TL;DR

This paper reviews backdoor attack vulnerabilities in contrastive learning, analyzing threat models, attack methods, defenses, and discussing future research directions to enhance security in various domains.

Contribution

It provides a comprehensive comparison of backdoor attacks in contrastive learning and highlights specific vulnerabilities and challenges for future research.

Findings

01

Contrastive learning is vulnerable to backdoor and data poisoning attacks.

02

Current defenses are limited and need further development.

03

Understanding attack methods aids in designing more secure contrastive learning systems.

Abstract

Contrastive learning has become a leading self- supervised approach to representation learning across domains, including vision, multimodal settings, graphs, and federated learning. However, recent studies have shown that contrastive learning is susceptible to backdoor and data poisoning attacks. In these attacks, adversaries can manipulate pretraining data or model updates to insert hidden malicious behavior. This paper offers a thorough and comparative review of backdoor attacks in contrastive learning. It analyzes threat models, attack methods, target domains, and available defenses. We summarize recent advancements in this area, underline the specific vulnerabilities inherent to contrastive learning, and discuss the challenges and future research directions. Our findings have significant implications for the secure deployment of systems in industrial and distributed environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks