Invisible Backdoor Attack against Self-supervised Learning

Hanrong Zhang; Zhenting Wang; Boheng Li; Fulin Lin; Tingxu Han; Mingyu; Jin; Chenlu Zhan; Mengnan Du; Hongwei Wang; Shiqing Ma

arXiv:2405.14672·cs.CV·April 4, 2025

Invisible Backdoor Attack against Self-supervised Learning

Hanrong Zhang, Zhenting Wang, Boheng Li, Fulin Lin, Tingxu Han, Mingyu, Jin, Chenlu Zhan, Mengnan Du, Hongwei Wang, Shiqing Ma

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel, imperceptible backdoor attack against self-supervised learning models that is highly effective, stealthy, and resistant to existing defenses, by designing optimized triggers aligned with data augmentation.

Contribution

The paper develops a new backdoor attack method for SSL models using optimized triggers that are imperceptible and disentangled from data augmentation, overcoming limitations of previous approaches.

Findings

01

Attack is highly effective across multiple datasets and SSL algorithms.

02

The attack remains stealthy and resistant to existing defenses.

03

Proposed triggers are imperceptible to human vision and aligned with data augmentation.

Abstract

Self-supervised learning (SSL) models are vulnerable to backdoor attacks. Existing backdoor attacks that are effective in SSL often involve noticeable triggers, like colored patches or visible noise, which are vulnerable to human inspection. This paper proposes an imperceptible and effective backdoor attack against self-supervised models. We first find that existing imperceptible triggers designed for supervised learning are less effective in compromising self-supervised models. We then identify this ineffectiveness is attributed to the overlap in distributions between the backdoor and augmented samples used in SSL. Building on this insight, we design an attack using optimized triggers disentangled with the augmented transformation in the SSL, while remaining imperceptible to human vision. Experiments on five datasets and six SSL algorithms demonstrate our attack is highly effective and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications