UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion   Models

Yuning Han; Bingyin Zhao; Rui Chu; Feng Luo; Biplab Sikdar; Yingjie; Lao

arXiv:2412.11441·cs.CR·March 3, 2025

UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models

Yuning Han, Bingyin Zhao, Rui Chu, Feng Luo, Biplab Sikdar, Yingjie, Lao

PDF

Open Access 1 Repo

TL;DR

UIBDiffusion introduces a universal, imperceptible backdoor attack for diffusion models using adversarial perturbations, achieving high effectiveness and stealthiness while evading current defenses across various datasets and models.

Contribution

The paper proposes a novel universal imperceptible backdoor attack for diffusion models using adversarial perturbations, enhancing stealthiness and effectiveness compared to prior methods.

Findings

01

Achieves high attack success rate with low poison rates

02

Universal triggers effective across different images and models

03

Can bypass state-of-the-art defenses like Elijah and TERD

Abstract

Recent studies show that diffusion models (DMs) are vulnerable to backdoor attacks. Existing backdoor attacks impose unconcealed triggers (e.g., a gray box and eyeglasses) that contain evident patterns, rendering remarkable attack effects yet easy detection upon human inspection and defensive algorithms. While it is possible to improve stealthiness by reducing the strength of the backdoor, doing so can significantly compromise its generality and effectiveness. In this paper, we propose UIBDiffusion, the universal imperceptible backdoor attack for diffusion models, which allows us to achieve superior attack and generation performance while evading state-of-the-art defenses. We propose a novel trigger generation approach based on universal adversarial perturbations (UAPs) and reveal that such perturbations, which are initially devised for fooling pre-trained discriminative models, can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TheLaoLab/UIBDiffusion
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsDiffusion